fastai-bert-finetuning

Python >= 3.6

Overview

We will finetune pre-trained BERT model on The Microsoft Research Paraphrase Corpus (MRPC). MRPC is a paraphrase identification dataset, where systems aim to identify if two sentences are paraphrases of each other.

Steps

Clone the repo and install dependencies by running:

pip install -r requirements.txt

Execute "Finetuning Bert on MRPC Corpus using FastAI" notebook

Results

We achieve high accuracy of ~0.82 and f1 score of ~0.87 by running for only 3 epochs.

Mask Language Model Demo

We can load a BERT model with the masked language modeling head and predict masked words.

bert_token_model = bert_helper.BertMaskedLM()
text = '[CLS] Steve Jobs founded [MASK] . [SEP][CLS] Microsoft makes [MASK] . [SEP]'
preds = bert_token_model.predict_tokens(text)
for p in preds: print(p)

Acknowledgements

Thanks to Keita Kurita for this excellent starter: A Tutorial to Fine-Tuning BERT with Fast AI

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
.gitignore		.gitignore
Finetuning Bert on MRPC Corpus using FastAI.ipynb		Finetuning Bert on MRPC Corpus using FastAI.ipynb
README.md		README.md
bert_fastai.py		bert_fastai.py
bert_helper.py		bert_helper.py
download_glue_data.py		download_glue_data.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

fastai-bert-finetuning

Python >= 3.6

Overview

Steps

Results

Mask Language Model Demo

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

DeepakSinghRawat/fastai-bert-finetuning

Folders and files

Latest commit

History

Repository files navigation

fastai-bert-finetuning

Python >= 3.6

Overview

Steps

Results

Mask Language Model Demo

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages