-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when reading data from Delta Lake table on S3 #2292
Comments
Thanks @Sanjay-M! We're taking a look at this now :) |
Could you also supply the plan that is printed with |
@jaychia It can build the logical plan but throws an error while building the physical plan. |
Got it, thanks! Other information that would be helpful for debugging:
It would also be super helpful if you could share the output of |
Yes, the delta table is partitioned.
I thought the error could be due to partition column values being null so I tried to replace them with NA
When I tried to to_pandas() after replacing NULL values, I got the below error
|
Hey @Sanjay-M ! Just merged a fix for this, it should be ready in the next release. |
Describe the bug
Error while reading the data on Delta Lake on S3 with daft and it is not able to generate a physical plan.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Expect it to convert the data frame to pandas or materialize it in the local
Information
Additional context
The Python Delta Lake library can read the data properly.
df.explain(True), df.collect(), df.to_pandas() gives error but it works with df.limit(1).to_pandas()
Error Log with RUST_BACKTRACE=full
The text was updated successfully, but these errors were encountered: