Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Measure categorisation quality #10841

Open
4 tasks
aleene opened this issue Sep 28, 2024 · 0 comments
Open
4 tasks

Measure categorisation quality #10841

aleene opened this issue Sep 28, 2024 · 0 comments
Labels
categories 🧽 Data quality https://wiki.openfoodfacts.org/Quality

Comments

@aleene
Copy link
Contributor

aleene commented Sep 28, 2024

Description

At the moment there are very few checks on the categorisation quality. A user can do what he likes and sees fit. And OFF has no ways of checking this. There are small steps made towards monitoring the categorisation (thinking of olive oils). Likely more can be done, which we can define here.

Acceptance criteria

The end-situation should be that every category has its monitoring parameters. It would be great that the monitoring happens live, so that a users sees it immediately when she/he inputs the wrong category.

What would a demo look like

Error messages such are used now for nutritional quality checks and similar to single ingredient checks.

Notes

The parameters for these checks can be either encoded in the taxonomy or calculated;

Tasks

  • [X ] Single ingredient checks - for categories that can only have ONE ingredient
  • Multiple ingredient checks - for products that contain multiple ingredients (2 or more) and exactly those ingredients.
  • Minimally required ingredients - for products that should have at least these (1 or more) ingredients. For instance Asparagus soups should at least contain the ingredient asparagus. And canned asparagus should contain the ingredient asparagus AND water;
  • Excluded ingredients - if a category is defined by products with multiple ingredients, but some ingredients are not allowed as these define another category. This could be used to catch common categorisation errors;
  • Nutritional envelopes - for each category an envelope for acceptable values can be defined. If a product is entered outside this envelope a warning should be triggered;
@github-project-automation github-project-automation bot moved this to To discuss and validate in 🍊 Open Food Facts Server issues Sep 28, 2024
@teolemon teolemon added categories 🧽 Data quality https://wiki.openfoodfacts.org/Quality labels Sep 30, 2024
@teolemon teolemon changed the title Categorisation quality Measure categorisation quality Oct 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
categories 🧽 Data quality https://wiki.openfoodfacts.org/Quality
Projects
Status: To discuss and validate
Status: To do
Development

No branches or pull requests

2 participants