You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi M. H. Kwon,
Your tokenization script is really helpful.
I trained a bert model with custom corpus using Google's Scripts like create_pretraining_data.py, run_pretraining.py ,extract_features.py etc..as a result I got vocab file, .tfrecord file, .jason file and check point files.
Now how to use those file for the below tasks:
to predict a missing word in a given sentence?
for next sentence prediction
Q and A model
Need your help.
The text was updated successfully, but these errors were encountered:
For 3. Q and A model,
Visit official bert github.
There are instructions about how to do tasks like QA(SQuAD).
Predicting missing words and next sentence prediction are usually used for training.
If you want to predict missing words for practical purpose, you need to make your own code.
You can refer to evaluation part of run_pretraining.py.
It's almost same.
Hi M. H. Kwon,
Your tokenization script is really helpful.
I trained a bert model with custom corpus using Google's Scripts like create_pretraining_data.py, run_pretraining.py ,extract_features.py etc..as a result I got vocab file, .tfrecord file, .jason file and check point files.
Now how to use those file for the below tasks:
Need your help.
The text was updated successfully, but these errors were encountered: