This repository contains the functionality to standardize the data of the List of Invasive Alien Species of Union concern to a Darwin Core Archive that can be harvested by GBIF.
source data → Darwin Core mapping script → generated Darwin Core files
The source data are manually created from:
- 2022 consolidated version pdf: scientific names with authors, synonym names (in parenthesis) with authors
- spreadsheets containing common names: first version (January 2018) and later additions (April 2020 and January 2022)
The repository structure is based on Cookiecutter Data Science and the Checklist recipe. Files and directories indicated with GENERATED
should not be edited manually.
├── README.md : Description of this repository
├── LICENSE : Repository license
├── union-list.Rproj : RStudio project file
├── .gitignore : Files and directories to be ignored by git
│
├── data
│ ├── raw : Source data, input for mapping script
│ └── processed : Darwin Core output of mapping script GENERATED
│
└── src
└── dwc_mapping.Rmd : Darwin Core mapping script, core functionality of this repository
- Clone this repository to your computer
- Open the RStudio project file
- Open the
dwc_mapping.Rmd
R Markdown file in RStudio - Install any required packages
- Click
Run > Run All
to generate the processed data
MIT License for the code and documentation in this repository. The included data is released under another license.