-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CJIO fails on bigger Dutch OpenData files #54
Comments
cjio works for me with one such file, but I have 32GB of RAM... But the main issue is that the files distributed are indeed gigantic. I wish Kadaster could split the tiles into 4 or even 16 sub-tiles. We have told them, but if more do the same it could help. |
While doing some work for QGIS some months ago, I found that for GeoJSON there are some 'streaming'-variants: see This makes it possible (at least for GeoJSON) to do some sort of streaming reading and writing. |
You are right, and I'm fully aware. Worked on this a bit a few months ago, in the context of WFS3 but it would help here too. We developed CityJSON to be simple to process and analyse and manipulate, such large datasets were not our first priority. The proposal for how it would work is there: https://github.com/hugoledoux/cityjson_ogcapi/blob/master/best-practice.md and there's code: We have an MSc student starting on this topic in September, so expect some results by Christmas! |
interesting, @hugoledoux you may want to get in touch with the pygeoapi team. at least I was not aware of this development. there is e.g. support for JSON-LD to link features. streaming would be very interesting. backed by a PG DB this was something the |
Note to self: Briefly tried simdjson as a drop-in replacement for Minor optimizations, like using tuples instead of lists for storing the vertices have a negligible impact. I think that the |
In The Netherlands PDOK makes cityjson files availabe of NL:
https://brt.kadaster.nl/basisvoorziening-3d/
For example the part with my home town haarlem is this one:
https://download.pdok.nl/kadaster/basisvoorziening-3d/v1_0/2018/volledig/25az1.volledig.zip
The ZIP is 553Mb, unzipped it is 2.7Gb(!)
Trying to cut out a small peace (to load in in QGIS with the cityjson plugin):
cjio 25az1.json subset --bbox 104607 490148 104703 490257 save myarea.json
My (rather beefy Linux) laptop (16Gb, 8 threads) kills the process after a lot of swapping...
So my question:
Any hint on how to handle this opendata is appreciated :-)
The text was updated successfully, but these errors were encountered: