Skip to content

add ability to generate basic statistics for the individual datasets #217

Open
@ymahlich

Description

@ymahlich

Create a suite of scripts (maybe as part of the coderdata.utils) that can generate basic statistics on the datasets, specifically in regards to metrics that are vital to balancing train/test/validate splits.

Should output basic info as tables and plots.

Metadata

Metadata

Assignees

Labels

documentationImprovements or additions to documentationenhancementNew feature or request

Type

No type

Projects

Status

In progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions