Description
What would you like to discuss?
As per #826, since we won't rely on frictionless-py
's validate()
function, we'll need to set up our own. As with good design, keeping the functions small and targeted is a good aim, so here's some ideas for checks to have. Add more if you think of any.
This includes both verifying and validating checks. But doesn't mean we will implement them right away.
Against data package itself
- check that (our and Frictionless) required fields are filled in
- check that fields are the correct type (e.g. str vs int)
Against data resources
- check that (our and Frictionless) required fields are filled in
- check that number of columns are equal between data file and properties
- check that number of rows are the same (between only Parquet file and properties, since raw files will not always have the same)
- check that data types of columns are identical between data and properties
- check that data in column match the constraints, if the field is filled in
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status