pre-training masked language modeling accuracy #182

vigneshvalliappan · 2022-03-28T01:38:48Z

vigneshvalliappan
Mar 28, 2022

Hi all,

Thank you for the amazing work.

In your paper, the performance on various downstream tasks is noted.
https://www.pnas.org/doi/10.1073/pnas.2016239118

I am curious to find out about the pre-training masked language modeling accuracy.
Is there a rough estimate of what pre-training masked language modeling accuracy is? Do most of the accuracy gains happen in the initial 50,000 training steps or so? I understand that there are 500,000 training steps in total.

I am asking because I'm looking to learn more about peptide sequences using a much smaller model, and am curious to know what the pre-training accuracy benchmarks are.

Thank you for your time and consideration.

Regards,
Vignesh

Answered by tomsercu

Mar 30, 2022

Hi Vignesh, thanks for your interest!
We haven't really monitored accuracy during training. The pnas paper you link discusses our main metric ECE at length, which is directly related to the objective being minimized in MLM training.
Given the pre-trained models, it would be easy to compute accuracy on your dataset eg by masking out single positions one by one, or repeated random masking 15% in the way the model was trained.

View full answer

tomsercu · 2022-03-30T02:15:12Z

tomsercu
Mar 30, 2022

Hi Vignesh, thanks for your interest!
We haven't really monitored accuracy during training. The pnas paper you link discusses our main metric ECE at length, which is directly related to the objective being minimized in MLM training.
Given the pre-trained models, it would be easy to compute accuracy on your dataset eg by masking out single positions one by one, or repeated random masking 15% in the way the model was trained.

0 replies

vigneshvalliappan · 2022-03-30T08:29:57Z

vigneshvalliappan
Mar 30, 2022
Author

Thank you for very much for the information.

Regards,
Vignesh

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pre-training masked language modeling accuracy #182

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

pre-training masked language modeling accuracy #182

vigneshvalliappan Mar 28, 2022

Replies: 2 comments

tomsercu Mar 30, 2022

vigneshvalliappan Mar 30, 2022 Author

vigneshvalliappan
Mar 28, 2022

tomsercu
Mar 30, 2022

vigneshvalliappan
Mar 30, 2022
Author