-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
execution::context::sql should support creating partitioned dataset #1220
Comments
related #1139 |
Auto detecting common partitioning schemes seems like a good idea to me @jimexist Something else I have been wondering is "how general do we want our partitioning to be" -- what @rdettai has implemented is the classic hive partitioning when partitioning by date looks like But there are other ways to partition data (e.g. IOx has its own way to partition data into individual files but the partition metadata is stored in some in-memory catalog) -- it would be pretty cool to re-use all the partitioning infrastructure (as well as, for example, add more sophisticated partition pruning) |
#1185 is also a follow up to #1139 that is closely related to this. Maybe we can merge the two issues and create subtasks? @alamb my idea was that each standard/technique for getting the list of files (table catalog) should be a different provider. The |
Duplicate of #1185 |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
A challenge to allow
execution::context::sql
supporting creation of partitioned dataset.For now:
https://github.com/apache/arrow-datafusion/blob/75b8112ee33af81d6085be4a83a096bf965dbc89/datafusion/src/execution/context.rs#L187-L225
the line for
table_partition_cols
is emptyDescribe the solution you'd like
Ideally we should allow the same syntax, but allow auto detection of partitioned dataset based on either the
location
is a file or a directoryDescribe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: