-
Notifications
You must be signed in to change notification settings - Fork 367
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Poor readtable speed #942
Comments
What happens if you turn off the garbage collector with |
it doesn't speedup loading. |
Try |
Sorry but I haven't found what should be imported to have this function available. |
|
I was able to load this entire dataset with |
on my side julia> using Base: readdlm
julia> @time dat=readdlm("AUDUSD-2014-01.csv", ',');
76.044753 seconds (31.17 M allocations: 1.012 GB, 78.82% gc time)
julia> @time dat_no_sep=readdlm("AUDUSD-2014-01.csv");
15.322299 seconds (15.58 M allocations: 590.372 MB, 60.08% gc time)
julia> @time dat_csv=readcsv("AUDUSD-2014-01.csv");
16.317755 seconds (31.17 M allocations: 1.012 GB, 47.67% gc time) |
|
Hello,
I try to read 1 month of tick data of AUD/USD
Sample data can be found here
https://drive.google.com/file/d/0B8iUtWjZOTqla3ZZTC1FS0pkZXc/view?usp=sharing
see also pydata/pandas-datareader#153
AUDUSD-2014-01.zip
is a 11M file and containsAUDUSD-2014-01.csv
which is a 85M filewhich is not so big!
With Python / Pandas
With Julia / DataFrames.jl / readtable
see also JuliaLang/julia#16015
Kind regards
PS: use
to have column name set correcly
use
to convert to DateTime
The text was updated successfully, but these errors were encountered: