etl-biodata

ETL Water Quality Data from the BioData System

These scripts are run by the OWI Jenkins Job Runners. The job name is WQP_BIODATA_ETL. They follow the general OWI ETL pattern using ant to control the execution of PL/SQL scripts.

The basic flow is:

Copy data from the BioData Retrieval system into the biodata schema of the nolog database. (copyFromDW.sql)
Drop the referential integrity constraints on the biodata swap tables of the wqp_core schema. (dropRI.sql)
Drop the indexes on the biodata station swap table, populate with transformed data, and rebuild the indexes. (transformStation.sql)
Drop the indexes on the biodata activity swap table, populate with transformed data, and rebuild the indexes. (transformActivity.sql)
Drop the indexes on the biodata result swap table, populate with transformed data, and rebuild the indexes. (transformResult.sql)
Drop the indexes on the biodata summary swap tables, populate with transformed data, and rebuild the indexes. (createSummaries.sql)
Drop the indexes on the biodata code lookup swap tables, populate with transformed data, and rebuild the indexes. (createCodes.sql)

Note: Several code lookup values are dependent on data from the WQP_NWIS_ETL correctly collecting data from natprod.
Add back the referential integrity constraints on the biodata swap tables of the wqp_core schema. (addRI.sql)
Analyze the biodata swap tables of the wqp_core schema. (analyze.sql)
Validate that rows counts and change in row counts are within the tolerated values. (validate.sql)
Install the new data using partition exchanges. (install.sql)

The translation of data is specific to this repository. The heavy lifting (indexing, RI, partition exchanges, etc.) is done using common packages in the wqp_core schema. These are defined in the schema-wqp-core repository.

Name		Name	Last commit message	Last commit date
Latest commit History 200 Commits
src		src
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
Jenkinsfile.build		Jenkinsfile.build
README.md		README.md
addRI.sql		addRI.sql
analyze.sql		analyze.sql
build.xml		build.xml
copyFromDw.sql		copyFromDw.sql
createCodes.sql		createCodes.sql
createSummaries.sql		createSummaries.sql
docker-compose.yml		docker-compose.yml
dropRI.sql		dropRI.sql
install.sql		install.sql
parsing_rules.txt		parsing_rules.txt
pipeline.yml		pipeline.yml
pom.xml		pom.xml
transformActivity.sql		transformActivity.sql
transformAndLoadData.sql		transformAndLoadData.sql
transformOrgData.sql		transformOrgData.sql
transformResult.sql		transformResult.sql
transformStation.sql		transformStation.sql
validate.sql		validate.sql

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

etl-biodata

About

Releases

Packages

Contributors 6

Languages

NWQMC/etl-biodata

Folders and files

Latest commit

History

Repository files navigation

etl-biodata

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages