Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avro block decompression #5306

Merged
merged 3 commits into from
Jan 17, 2024
Merged

Avro block decompression #5306

merged 3 commits into from
Jan 17, 2024

Conversation

tustvold
Copy link
Contributor

Which issue does this PR close?

Part of #4886

Rationale for this change

Adds support for block decompression for common codecs.

What changes are included in this PR?

Are there any user-facing changes?

@github-actions github-actions bot added the arrow Changes to the arrow crate label Jan 16, 2024
pub enum CompressionCodec {
Null,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

None of this is public yet, so this isn't a breaking change

@@ -55,7 +56,7 @@ impl Header {
/// Returns an iterator over the meta keys in this header
pub fn metadata(&self) -> impl Iterator<Item = (&[u8], &[u8])> {
let mut last = 0;
self.meta_offsets.windows(2).map(move |w| {
self.meta_offsets.chunks_exact(2).map(move |w| {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a silly bug, that caused it to fail to read kv pairs after the first

Deflate,
BZip2,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I opted to just support the codecs that overlapped with parquet for now, to avoid any net new libraries to this project

arrow-avro/src/reader/mod.rs Outdated Show resolved Hide resolved
@tustvold tustvold merged commit 202e9a1 into apache:master Jan 17, 2024
23 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Changes to the arrow crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants