Skip to content

sairamdgr8/Schema_building_project_Raw_layer_validation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Schema_building_project_Raw_layer_validation

Raw_layer_validation

This is a mini Pyspark project maintaining the integrity of the Metadata.

This code can be used further to extend ETL functionality,other dependency checks like using it for database connections etc.

  1. it checks the number of columns ,column name and column datatype are matching with the source schema to metaschema.
  2. if column count dosent matches then the code will fail to write the job to the target location.

Note:here MetaSchema is a json file which consists of column name and and its dataype defined by you inorder to validate with the respective source file schema

About

Raw_layer_validation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published