Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Manual labeling as heuristic #111

Open
hlibbabii opened this issue May 7, 2021 · 0 comments
Open

Manual labeling as heuristic #111

hlibbabii opened this issue May 7, 2021 · 0 comments
Labels
enhancement New feature or request p1-important

Comments

@hlibbabii
Copy link
Member

hlibbabii commented May 7, 2021

Representation of manual labels in BOHR:

Manual label for some dataset should be represented as a separate CSV file with the following fields (columns):

  • datapoint id
  • label
  • author
  • note (optional filed ignored by BOHR but might be useful for annotators)
  • certainty (from scale 1-5, how sure the annotator is)
  • showcase (1 if data point represents an interesting case to be discussed or a good example to be included in the paper)

There might be multiple rows corresponding to the same data point from the same author in case multiple labels need to be decided (certainties for different labels can also differ)

Handling of manual labels by BOHR

BOHR should convert manual labels to heuristic, one for each author and certainty level. In this way labels from different authors and different certainties have different weights.

Further features and improvements

  • once the label hierarchy is updated, provide a mechanism to see which labels can be assigned even more fine-grained label according to the updated hierarchy

Other considerations

Make use of external tools for manual labeling (Prodigy? : https://prodi.gy/)
Make use of active learning

@hlibbabii hlibbabii changed the title Label infrastructure improvements Manual labeling infrastructure improvements May 22, 2021
@hlibbabii hlibbabii changed the title Manual labeling infrastructure improvements Manual labeling as heuristic May 24, 2021
@hlibbabii hlibbabii added enhancement New feature or request p1-important labels May 24, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request p1-important
Projects
None yet
Development

No branches or pull requests

1 participant