-
Notifications
You must be signed in to change notification settings - Fork 166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PyIceberg Near-Term Roadmap #736
Comments
@kevinjqliu @Fokko Where would something like the Iceberg Spark |
Sorry for the late reply, I was touching grass.
Thanks for bringing this up @corleyma 🙌 Some related work is being done in #533 and I think PyIceberg should definitely support something like that.
I've added it to the overview. Once the partial deletes + partitioned writes are in, this is supported automatically. We might want to have some community discussion on the API once those two PRs land. |
@Fokko can we add issues for creating tests and documentation for the new features of 0.7.0 as good first issues? |
@tusharchou: Whenever you create a new feature, you need to add the unit & integration test and make the necessary changes in mkdocs as a part of that PR, but if you feel like there are some missing parts, please feel free to raise an improvement/issue and we can discuss that in the python syncup. |
It looks BigLake metastore is going to be replaced with BigQuery metastore. Is the version 0.8.0 roadmap still up-to-date? |
@jaehyeon-kim That is correct. BigQuery Metastore is the replacement for BigLake Metastore. I recommend adjusting the roadmap to skip BigLake metastore and add support for BigQuery Metastore. This PR to the Iceberg Java libraries should be good reference. |
Thanks for the context @anoopj. @jaehyeon-kim looks like #651 is a feature request. There's currently no committed date to implement it, I'll readjust the roadmap to reflect that. |
Feature Request / Improvement
PyIceberg 0.7.0
The main objective of 0.7.0 is to have partitioned writes (non-exhaustive list :)
WriteTask
s #444table_exists
on catalog: check if table exist #406 Addtable_exists
method to the Catalog #507, fixed in Addtable_exists
method to Catalog #512metadata_log_entries
metadata table #594): Metadata Log Entries metadata table #667RollingManifestWriter
#650PyIceberg 0.8.0
PyIceberg 1.0.0
Long-term goals:
DataFrame.write_iceberg
pola-rs/polars#15018)The text was updated successfully, but these errors were encountered: