-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[C++][Parquet] Regression reading byte-stream-split encoded floats with null values in Arrow 16.0.0 #41562
Labels
Milestone
Comments
jorisvandenbossche
added
Priority: Blocker
Marks a blocker for the release
Critical Fix
Bugfixes for security vulnerabilities, crashes, or invalid data.
backport-candidate
and removed
Critical Fix
Bugfixes for security vulnerabilities, crashes, or invalid data.
labels
May 7, 2024
@adamreeve thanks a lot for the report. Can confirm this locally with latest main as well (on Linux / Ubuntu). |
I get the reason here, will fix it soon |
1 task
I submit a basic bugfix. I'm a bit busy on worktime, will add test after 9pm in utc-8 when I back home |
pitrou
pushed a commit
that referenced
this issue
May 7, 2024
…amSplitDecoder (#41565) ### Rationale for this change This problem is raised from #40094 . Original bug fixed here: #34140 , but this is corrupt in #40094 . ### What changes are included in this PR? Refine checking ### Are these changes tested? * [x] Will add ### Are there any user-facing changes? Bugfix * GitHub Issue: #41562 Authored-by: mwish <[email protected]> Signed-off-by: Antoine Pitrou <[email protected]>
Issue resolved by pull request 41565 |
raulcd
pushed a commit
that referenced
this issue
May 8, 2024
…amSplitDecoder (#41565) ### Rationale for this change This problem is raised from #40094 . Original bug fixed here: #34140 , but this is corrupt in #40094 . ### What changes are included in this PR? Refine checking ### Are these changes tested? * [x] Will add ### Are there any user-facing changes? Bugfix * GitHub Issue: #41562 Authored-by: mwish <[email protected]> Signed-off-by: Antoine Pitrou <[email protected]>
vibhatha
pushed a commit
to vibhatha/arrow
that referenced
this issue
May 25, 2024
…teStreamSplitDecoder (apache#41565) ### Rationale for this change This problem is raised from apache#40094 . Original bug fixed here: apache#34140 , but this is corrupt in apache#40094 . ### What changes are included in this PR? Refine checking ### Are these changes tested? * [x] Will add ### Are there any user-facing changes? Bugfix * GitHub Issue: apache#41562 Authored-by: mwish <[email protected]> Signed-off-by: Antoine Pitrou <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Describe the bug, including details regarding any error messages, version, and platform.
Write byte-stream-split encoded floats containing null values:
And then attempt to read the data back:
The above code works with pyarrow 15.0.2 but fails with pyarrow 16.0.0 with the following exception:
Writing the data with pyarrow 15.0.2 and reading with pyarrow 16.0.0 also fails, but writing with 16.0.0 and reading with 15.0.2 works fine. Disabling byte stream split encoding or not writing any nulls also makes the error go away.
This looks related to #28737 although the error there was quite different.
Component(s)
C++, Parquet
The text was updated successfully, but these errors were encountered: