Skip to content
This repository has been archived by the owner on Aug 1, 2024. It is now read-only.

pre-training masked language modeling accuracy #182

Answered by tomsercu
vigneshvalliappan asked this question in Q&A
Discussion options

You must be logged in to vote

Hi Vignesh, thanks for your interest!
We haven't really monitored accuracy during training. The pnas paper you link discusses our main metric ECE at length, which is directly related to the objective being minimized in MLM training.
Given the pre-trained models, it would be easy to compute accuracy on your dataset eg by masking out single positions one by one, or repeated random masking 15% in the way the model was trained.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by vigneshvalliappan
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants