Skip to content

Dump converter that downloads and parses dumps for external validation and exports them as csv.

Notifications You must be signed in to change notification settings

WikidataQuality/DumpConverter

Repository files navigation

DumpConverter Build Status Coverage Status Scrutinizer Code Quality

Downloads and parses dumps of databases for external validation in Wikidata using the WikibaseQualityExternalValidation extension.

Installation

DumpConverter requires libxml2 and libxslt to be installed. To install the required development packages of these dependencies on Linux systems, use your distribution specific installation tool, e.g. apt-get on Debian/Ubuntu.

sudo apt-get install libxml2-dev libxslt-dev python-dev

To install DumpConverter, just run the setup script.

git clone https://github.com/WikidataQuality/DumpConverter.git
sudo pip install -e DumpConverter

Usage

python dumpconverter.py [options]

Options

  • --list-databases list all available databases, that can be imported and exit
  • -d / --database DATABASE key of a specific database that should be imported
  • --external-values-file EXTERNAL_VALUES_FILE CSV output file for data values of dumps. - default: external_values.csv
  • --dump-information-file DUMP_INFORMATION_FILE CSV output file for meta informations of dumps. - default: dump_information.csv
  • -q / --quiet suppress output

About

Dump converter that downloads and parses dumps for external validation and exports them as csv.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages