-
Notifications
You must be signed in to change notification settings - Fork 270
[feat] Ability to read table using version-hint.txt
#763
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
We discussed this issue in the monthly sync and agreed that this is a useful feature. We'll first implement the read side in I will raise this issue with the Java Iceberg implementation and see if there's support also to include this as part of the Iceberg spec. |
@lamb-russell duckdb_iceberg can read the "metadata json file" directly. |
It would be great if duckdb_iceberg could support reading directly from the catalog. |
it is quite ironic, it seems the only iceberg vendor who generate hint.text is snowflake !!! go figure edit : no more :( snowflake stopped producing hint.text :( |
I think it is fine to add support for reading the |
version-hint.txt
version-hint.txt
@Fokko is this issue still open for working on? For context, we had to build a PyIceberg-based Hadoop Catalog with a subset of features for backwards compatibility when moving Bodo from Iceberg-Java to PyIceberg. See https://github.com/bodo-ai/Bodo/blob/main/bodo/io/iceberg/catalog/dir.py. It would be nice to move at least the read parts to the main repo |
fwiw, I just gave up and I am using duckdb to read iceberg table , pycieberg is clearly not interested in this scenario |
@srilman Yes, I still think it would be valuable |
I submitted a small PR to allow using |
Feature Request / Improvement
Although not in the official spec,
version-hint.txt
can be useful to read an iceberg table without a catalog.This is useful when considering an iceberg table as a collection of files (metadata and data files) in a "directory" (s3 path). This can also be useful when ingesting iceberg tables without a catalog. An iceberg table can thus be "packaged" as a directory.
Example Usecase
When reading,
version-hint.txt
determines the metadata json, usually provided by querying the catalog.When writing,
version-hint.txt
is committed with the atomic update to the catalog.Additionally,
StaticTable
can useversion-hint.txt
to create an iceberg table from a path.Relevant Issues:
cc @djouallah
The text was updated successfully, but these errors were encountered: