Skip to content
/ bdc Public
forked from brunobrr/bdc

Check out the vignettes with detailed documentation on each step of the BDC workflow

License

Notifications You must be signed in to change notification settings

sjevelazco/bdc

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BDC

A comprehensive and straightforward workflow for standardizing, integrating, and cleaning biodiversity data

R-CMD-check

Overview

Handle biodiversity data from several different sources is not an easy task. Here we present the Biodiversity Data Cleaning (BDC) workflow, an automated workflow to address quality issues and improve datas’ fitness-for-use. The workflow harmonizes and integrates data from different sources following common standards and protocols and implements various tests and tools to flag, document, clean, and correct the taxonomic, spatial, and temporal information of biodiversity data.

The workflow is composed of five core steps:

  1. Standardization and integration of different datasets;
  2. Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments (e.g., correct transposed coordinates and standardize country names);
  3. Taxonomy: cleaning, parsing, and standardization of scientific names against multiple taxonomic references. The workflow corrects spelling errors and converts nomenclatural synonyms to currently accepted names;
  4. Space: flagging of erroneous, suspicious, and low-precision geographic coordinates;
  5. Time: flagging and, whenever possible, correction of inconsistent collection date.

Aim to facilitate the documentation, visualization, and interpretation of results of data quality tests, in all steps, several files documenting the workflow results are saved automatically in a folder named “Output”. These files include i) records needing further inspection, ii) databases containing the results of each step, iii) figures, and iv) data-quality reports documenting the results.

Installation

You can install the released version of “BDC” from github with:

if (!require("remotes")) install.packages("remotes")
if (!require("bdc")) remotes::install_github("brunobrr/bdc")

Package website

See BDC package website (https://brunobrr.github.io/bdc/) for detailed explanation on each step of the workflow.

Getting help

If you encounter a clear bug, please file an issue here. For questions or suggestion, please send us a email ([email protected]).

About

Check out the vignettes with detailed documentation on each step of the BDC workflow

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 99.0%
  • Makefile 1.0%