-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simple API for resources #1286
base: master
Are you sure you want to change the base?
Simple API for resources #1286
Conversation
Thank you for taking the time to contribute this pull request @diogok. Please review my latest comment on issue #1249 Specifically please take note of all the functionality offered by the GBIF API as well as the Registered dataset inventory. If you still believe this request is needed and the above services don't satisfy your needs, please create a new issue to raise its visibility. When doing so, please be sure to explain the rationale and use cases for it as well. That way it is more visible and understandable to the wider community and can be voted on. Thanks |
I so much agree with both points of view. And I don't see any incompatibilites between them. GBIF API is my preferred way of accessing IPT published data for most purposes. I am more than happy enough with it, so I am not really rushed about having an alternative direct access through an IPT api. But if somebody (@diogok , @tigreped , @dvdscripter) wants to create a new issue to raise this need as @kbraak suggests, I will left some ideas here and explain the rationale and use cases for it. Feel free to use them: GBIF api certainly fails when you want to provide easy access to IPT datasets which you do still NOT want to be visible at GBIF data portal. This is not uncommon:
OK. All these are things that you could do creating a web service on your own, using whatever technologies you want. But that forces you to know how to do that, or have other people available that can do it for you. Why not take profit of IPT for doing it, not needing to develop anything else to provide your data? And the other way round, sometimes GBIF api also fail providing easy/correct/quick access to some of the IPT provided info which you DO want to be visible. This is not uncommon:
I will update the list if I figure out any other issues. Thanks a lot |
Thanks David, All your comments and ideas have been noted. Below I address a few of your points with some information that is hopefully useful to you. @mdoering is adding all type specimen names from occurrences to the backbone: You should already be able to do an occurrence search on the original scientific name in the upcoming version of GBIF.org: https://demo.gbif.org/occurrence/search Here you can also try doing a free-text search on the original scientific. GBIF exposes the identifications included in the Identification History Extension, e.g. https://demo.gbif.org/occurrence/1211192227 They should be searchable, at least using the free-text search. Please feel free to do more testing and provide additional feedback. I definitely agree GBIF.org should do better at warning users when indexing is off and explaining how indexing happens. Thanks for supporting gbif/portal16#6 that aims to address this. In case it helps, please see this page created by @dnoesgaard that gives an easy to read overview of the data publishing workflow: https://demo.gbif.org/tools/gbif-software When browsing https://demo.gbif.org, please feel free to log issues for anything that is missing (e.g. more fine-detail documentation explaining when GBIF reindexes datasets), or when you notice a feature isn't working properly (e.g. occurrence search filtered by original scientific name). In case you weren't aware, you can submit feedback easily using the message box in top right hand corner of each page. Ultimately we aim to have the correct processes and documentation in place to reduce load on the GBIF Helpdesk, however, when in doubt about something please don't hesitate to write [email protected] with your questions. Thanks again for your help. |
Hi @diogok and @dgasl nice to see this discussion going on. Sadly I already made an in-house solution for this problem at SiBBr. Our proxy API (we didn't change IPT) can generate meta.xml, process publishing on IPT and reports IPT from all patterns. So my problem is solved but I still see this feature as GBIF Community gain. |
Thank you all for the comments. @kbraak The API proposed here indeed overlaps with the "registered dataset inventory" and I was not aware of that at time of writing. The main addition is that it exposes more complete data and metadata, and also exposes resources not registered/published on GBIF (local only resources). My use case was a bit of some of the points raised by @dgasl:
Since our data was already on IPT for documenting, publishing and archival, it would be great to also use IPT as an integration tool, combined with the ability to download file from inside a resource's DwC-A; |
Hi @diogok Does all this conversation mean that you have implemented a local API for your IPT server, but gbif did not include it in their official version yet? Thanks a lot in advance |
Hello all,
Issue #1249 got my attention these days, and, while I agree that index and provide and an API upon the data it itself is out of scope for IPT, I beleave that a proper way to consume resource information (as in resources links, files, metadata and such) as a webservice would have value and ease the development of tools on top of IPT, such as the needed harversters and public interfaces.
This is mostly a working idea to discuss and maybe build upon.
This includes two endpoints: List of resources and resource details, with proper linking and data exposure.
Gist of how the data is exposed.