Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UTF-8 issues in scientificName #7

Closed
peterdesmet opened this issue Dec 22, 2017 · 4 comments
Closed

UTF-8 issues in scientificName #7

peterdesmet opened this issue Dec 22, 2017 · 4 comments

Comments

@peterdesmet
Copy link
Member

We currently have scientificNames with:

2772890 134087647 Loncomelos brevistylus (Wolfner ) Dost<U+00E1>l 9ff7d317-609b-4c08-bd86-3bc404b77c42 Loncomelos brevistylum (Wolfner) Dost<U+00E1>l Plantae SYNONYM 2772885 Ornithogalum pyramidale L.

As you notice, these issues appear in the checklist_scientificName and backbone_scientificName.

So, must be related to how the data are read from the API (because of our own code or rgbif).

@peterdesmet
Copy link
Member Author

Double checked: the name is incorrect in checklist_taxa_info[[1]]$data$scientificName[756], so definitely related to how the data are read.

@peterdesmet
Copy link
Member Author

rgbif::name_usage(key = 134087647) also returns the incorrect name.

@peterdesmet
Copy link
Member Author

Problem is resolved by setting:

Sys.setlocale("LC_CTYPE", "en_US.UTF-8")

Which is the same issue as: trias-project/alien-plants-belgium#41

peterdesmet added a commit that referenced this issue Dec 22, 2017
@peterdesmet
Copy link
Member Author

Fixed this with setting locale for now (967de37) so this issue can be closed. But that won't work on Windows, which is why I created a separate issue to test and discuss this at #8.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

1 participant