-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to retrieve object: only valid on seekable files #401
Comments
Hi, this seems to be related to a corrupted parquet file? Have you managed to trace back which S3 object is failing specifically, and tried to open it to verify it's ok? |
What version were you using before updating to v0.66? |
We were using v0.64 before updating to v0.66 |
That's strange because we only did work on improving performance on JSON since 0.64. I see in the filename that this issue relates to a object that were recently created. Can you confirm you didn't change anything on the ingestion mechanism, perhaps using different versions of pandas or similar libraries to produce the parquet objects? |
Hi @matteofigus, I went back and confirmed with the team there are no ingestion changes we did recently, the only change was upgrading s3f2 to 0.66 |
Hi @matteofigus , with the recent upgrade to v0.66, we are facing issues with Deletion Jobs which are failing with
FORGET_PARTIALLY_FAILED
error. Upon looking into the logs, the ObjectUpdateFailed error is "Unable to retrieve object: only valid on seekable files"Do you have any possible leads on what could cause this error?
We are using the fix in backend/ecs_tasks/delete_files/parquet_handler.py as mentioned here
Any advices?
The text was updated successfully, but these errors were encountered: