We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I have seen an issue when reading a Parquet file created by Hudi. There is a nesting list as below:
optional group a (LIST) { repeated group array (LIST) { repeated int32 array; } }
The C++ parquet reader infers its schema as array<struct<array:array<int>>>. The root cause is here:
array<struct<array:array<int>>>
arrow/cpp/src/parquet/arrow/schema.cc
Lines 657 to 663 in 12dddfc
I think we need to regard them as a nesting two-level list, meaning that the correct interpretation is array<array<int>>.
array<array<int>>
C++, Parquet
The text was updated successfully, but these errors were encountered:
apacheGH-43994: [C++][Parquet] Fix schema conversion from two-level e…
606a18a
…ncoding nested list
7e6cb6d
array<array<int>>?
Sorry, something went wrong.
1bc2d11
6e312ec
98b4efd
wgtmac
No branches or pull requests
Describe the bug, including details regarding any error messages, version, and platform.
I have seen an issue when reading a Parquet file created by Hudi. There is a nesting list as below:
The C++ parquet reader infers its schema as
array<struct<array:array<int>>>
. The root cause is here:arrow/cpp/src/parquet/arrow/schema.cc
Lines 657 to 663 in 12dddfc
I think we need to regard them as a nesting two-level list, meaning that the correct interpretation is
array<array<int>>
.Component(s)
C++, Parquet
The text was updated successfully, but these errors were encountered: