-
Notifications
You must be signed in to change notification settings - Fork 140
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CSV.Rows with CodecZlib Out-of-Memory #476
Comments
Sorry, I hit the submit issue button too early. I've updated the main text. |
Thanks for the report; we can be smarter about what we're doing here. |
… using mmapped buffers to slurp IO objects, which should be a little easier on overall memory. Now, this is an ok short-term fix, and is definitely smarter for the CSV.File case, but we really should find a true buffering solution for CSV.Rows, since the whole zen there is a low-memory footprint
@cpfiffer, I have a PR up here; it'd be great if you could try it out for your use-case (you can get that branch by doing |
… using mmapped buffers to slurp IO objects, which should be a little easier on overall memory. Now, this is an ok short-term fix, and is definitely smarter for the CSV.File case, but we really should find a true buffering solution for CSV.Rows, since the whole zen there is a low-memory footprint (#477)
Seems to be working on my side with |
Great! |
I'm reading some very large gzip files, where I want an iterator for each row. I have something similar to this:
I get an OOM error for fairly large files:
Any thoughts on what do do here?
The text was updated successfully, but these errors were encountered: