Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[js] Are there plans to support Parquet as a data format? #2209

Closed
eode opened this issue Jul 2, 2018 · 6 comments
Closed

[js] Are there plans to support Parquet as a data format? #2209

eode opened this issue Jul 2, 2018 · 6 comments

Comments

@eode
Copy link

eode commented Jul 2, 2018

Title pretty much says it all. I'm using Javascript, and would like to be able to read/write (read, mainly) Parquet data. I know it's not currently available, but is this a planned feature?

@wesm
Copy link
Member

wesm commented Jul 2, 2018

It's a good-sized project because of the dependency stack involved (Thrift, various compression libraries) and because Parquet itself is a somewhat complex file format. I opened https://issues.apache.org/jira/browse/ARROW-2786 so the feature can be tracked there

@wesm wesm closed this as completed Jul 2, 2018
@eode
Copy link
Author

eode commented Jul 3, 2018

Thanks. There's some existing work under an MIT license or similar variant here. It doesn't work for me, as it attempts to compile LZO, and it seems I don't have the right dev environment configured for that -- but it might be a useful start, or have some useful tidbits.

@wesm
Copy link
Member

wesm commented Jul 5, 2018

Ah, interesting. LZO isn't really used anymore (as soon as Snappy came around people largely stopped using it because of the *GPL licensing issues)

@ali-habibzadeh
Copy link

ali-habibzadeh commented May 4, 2021

Was there any progress made on this?

@westonpace
Copy link
Member

Was there any progress made on this?

None yet. The JIRA is still open and there is some discussion of alternatives (e.g. 3rd party parquet libraries, using the rust parquet implementation via wasm) but as far as I can glean from the mailing list there is no one working on a native JS parquet implementation within Apache Arrow. There is ongoing JS work (I believe they even had a virtual meetup in February) being discussed on the mailing list so if this is something you are interested in working on I would recommend reaching out on the dev mailing list.

@eatonphil
Copy link

Others have reported that this library supports in-browser reading of Parquet files: https://github.com/kbajalc/parquets

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants