Skip to content
This repository has been archived by the owner on Jan 31, 2024. It is now read-only.

ElixirTeSS/tess-bioschemas-scraper

Repository files navigation

TeSS Bioschemas Scraper

The TeSS Bioschemas Scraper is a web scraper that scrapes content providers for Bioschemas markup and saves it using the TeSS API.

The scraper makes use of the Comunica Framework, a framework for building semantic web applications.

Prerequisites

The only prerequisite is Node.js.

Setup

Install the dependencies

npm install

Update Configuration

For development, you need to update the config/development.yml and for production, you need to update the config/production.yml.

This fille needs updating with the URL and API key for the TeSS installation.

Run

Once you have install the dependencies, you can run the scraper with the following command:

npm run start

Debugging

All logs are written to the logs/ directory. Each day a new log file is created.

testing

run npm run test

About

Bioschemas Scraper for TeSS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published