Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parquet: write column_orders in FileMetaData #5158

Merged
merged 1 commit into from
Dec 5, 2023

Conversation

Jefffrey
Copy link
Contributor

@Jefffrey Jefffrey commented Dec 2, 2023

Which issue does this PR close?

Closes #5152

Rationale for this change

What changes are included in this PR?

Populate column_orders in Parquet FileMetaData when writing files.

Are there any user-facing changes?

@github-actions github-actions bot added the parquet Changes to the parquet crate label Dec 2, 2023
Comment on lines +334 to +337
// This field is optional, perhaps in cases where no min/max fields are set
// in any Statistics or ColumnIndex object in the whole file.
// But for simplicity we always set this field.
let column_orders = Some(column_orders);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines +327 to +328
// Currently only supported ColumnOrder is TypeDefinedOrder so we set this
// for all leaf nodes.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In future there may be new supported order, see here: apache/parquet-format#221

@tustvold tustvold merged commit f16d2f5 into apache:master Dec 5, 2023
16 checks passed
@Jefffrey Jefffrey deleted the parquet_write_col_order branch December 5, 2023 12:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Parquet: ColumnOrder not being written when writing parquet files
2 participants