Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

questions about case & null values in occurrence api response facets #133

Open
abubelinha opened this issue Jun 14, 2024 · 0 comments
Open

Comments

@abubelinha
Copy link

abubelinha commented Jun 14, 2024

Hello. I am testing different occurrence api calls and I noticed some issues which I think might be improved.
Example query:
https://api.gbif.org/v1/occurrence/search?limit=0&offset=0&facet=country&facet=acceptedTaxonKey&facet=acceptedScientificName&facet=scientificName&facet=verbatimScientificName&basisOfRecord=preserved_specimen&scientificName=Vandenboschia%20speciosa

  • All facets are being returned in lowercase. I believe this is necessary for better grouping text-free DwC fields like stateProvince , recordedBy or verbatimScientificName. But what about scientificName? I understand this is a standardized field coming from backbone matched values, so it's a shame having genus and authorship names in lowercase. That makes these values far less useful than they could be.
    Feature request: Would it be possible to keep the case for those scientific names coming from the backbone?
  • I can get the facetted acceptedTaxonKey, but acceptedScientificName facets are not being returned.
    Feature request: can we get the backbone acceptedScientificName faceted in occurrence responses? (also, in its original case).
    If this is technically possible, it would be so useful for producing quick occurrence based accepted species inventories with a single api call (related discourse question).
  • For non-mandatory fields (i.e. country, stateProvince, recordedBy, year) there is no information about empty/null values in facet information (in the above example 25% of the occurrences have NOT a country value, so NULL or '' would be the 2nd most frequent facet value!!).
    Feature request: Is there any way to optionally get count of null values in facets? (without having to sum all the other facets and substract from total results count)

Thanks a lot
@abubelinha

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant