Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Catalog using S3 List and Hive partitioning #11

Open
rdettai opened this issue Jan 22, 2021 · 1 comment
Open

Catalog using S3 List and Hive partitioning #11

rdettai opened this issue Jan 22, 2021 · 1 comment
Labels
enhancement New feature or request

Comments

@rdettai
Copy link
Contributor

rdettai commented Jan 22, 2021

It would be valuable to make it possible to dynamically create a partitioned catalog from the S3 file structure.

  • Create a new TableProvider that uses S3 List and the folder structure to create a file table
  • Implement the SplittableTable that using this provider
  • Cache this catalog to increase speed on subsequent calls
  • Add S3 List access to the Fuse
@rdettai rdettai added the enhancement New feature or request label Jan 22, 2021
@rdettai
Copy link
Contributor Author

rdettai commented Jun 30, 2021

Buzz now support DeltaLake as source Catalog thanks to #16. This feature might still be relevant to make Buzz more flexible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant