-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add country classifications #158
Comments
Hi,
Just one question: a lot of these data seems to be based on UN numeric and/or ISO2 - these are already included. Please make sure to not accidentally add entries which are already in there with another name. If you rather need synonyms you can define them in "_validate_input_para" in the main file. Again, please add tests if you do |
Thanks a lot. Indeed many of the codes are based on existing classifications. E.g. "unido_indstat" and "baci" use ISOnumeric such as "051" for Armenia.
Another example is "eurostat" which is based on multiple in coco existing classifications:
Other classifications such as "prodcom", which uses the codes from Geonomenclature (GEONOM) (I guess another name for "prodcom" is more appropriate) or "hybridexiobase4" would be new classifications for coco. |
Regarding EU, there this EU27 which is the "official" name for the new one. We have a section about that in the readme:
Generally, I would like to avoid just adding columns by data provider if they actually explicitly saying they are using one of the exiting ones. UN will probably use UN numeric in most cases (there is just a question of comparing with int or str). Hybridexio4 definetly make sense. Subregions are tricky. It is a bit more complicated then just adding a row for the subregion. The regular expression probably stop to work or get exponentially more complicated. Also, the linking of subregions to countries is not trivial (disputed areas, different classifications across countries, etc). I think there we would very much push the limit of what is possbile with a simple table. This region/subregion seems to best be handled in some kind of graph database? I would guess somethign like this must exist already. |
Alright, I created a PR #159 Regarding the subregions issue, from our point of view, it can be postponed. So the PR does not address it. |
Hi @konstantinstadler ,
we use coco already in our bonsai project. :-)
Since we use data from several data providers with their own country codes, we would like to extend the country_data.tsv by the following columns (classifications):
The mappings to ISO2 are already in another format in one of our repos.
If you agree, we would open a branch to implement these classifications.
The text was updated successfully, but these errors were encountered: