Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate with Kartothek #30

Open
dharhas opened this issue Feb 27, 2020 · 1 comment
Open

Integrate with Kartothek #30

dharhas opened this issue Feb 27, 2020 · 1 comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Milestone

Comments

@dharhas
Copy link
Member

dharhas commented Feb 27, 2020

Is your feature request related to a problem? Please describe.

SpatialPandas helps spatially sort data but we are seeing the need for higher level arbitrary indexing. Two example use cases:

  • Geospatial. We have spatially sorted daily GPS data for the US for multiple days. Getting a small region for a 60-90 day process can get bogged down by the need to read the 60-90 multiple metadata files and construct the task graph.

  • Astronomy. We have spatial data for multiple filters (HSC-Y, HSC-G etc). Again we would have to read multiple metadata files.

Describe the solution you'd like

The above could be fixed by building higher level indexes. I think we can benefit from integrating with kartothek. It enables an O(1) index and creates the necessary task graphs for reading just the partitions required. It could also be used to store the extra metadata spatialpandas currently stores in its own format (if I'm understanding spatialpandas correctly)

I'm at the Dask Dev Conference with some of the Kartothek devs and based on conversations with fjetter this integration should be possible.

@jbednar jbednar added this to the Wishlist milestone Jul 16, 2020
@jbednar jbednar added the enhancement New feature or request label Jul 16, 2020
@jbednar
Copy link
Member

jbednar commented Jul 16, 2020

Sounds cool to me!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants