Skip to content

[discussion]: Brainstorming for the different check functions to create #834

Closed
@lwjohnst86

Description

@lwjohnst86

What would you like to discuss?

As per #826, since we won't rely on frictionless-py's validate() function, we'll need to set up our own. As with good design, keeping the functions small and targeted is a good aim, so here's some ideas for checks to have. Add more if you think of any.

This includes both verifying and validating checks. But doesn't mean we will implement them right away.

Against data package itself

  • check that (our and Frictionless) required fields are filled in
  • check that fields are the correct type (e.g. str vs int)

Against data resources

  • check that (our and Frictionless) required fields are filled in
  • check that number of columns are equal between data file and properties
  • check that number of rows are the same (between only Parquet file and properties, since raw files will not always have the same)
  • check that data types of columns are identical between data and properties
  • check that data in column match the constraints, if the field is filled in

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions