-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fine tune vit5-base model for text summarization #6
Comments
Hi @MinhDang685 , for MLM pretraining we used mesh-tensorflow. The models on HuggingFace are ready for finetuning only. You don't need to add prefix in finetunning. |
Hi @justinphan3110, thanks for your quick reply.
Thanks |
|
Hi @justinphan3110, thanks for your help, I try to generate with the model again and it works now, the output sequences now changes base on the input I notice that you have updated the model config.json file by removing task specific prefixes, is it the cause of the issue (that I miss the "summarization" prefix before the input to indicate I want the model to perform summarization task)? |
@MinhDang685 , You can have a look over the eval scripts with HuggingFace |
thank you @justinphan3110 for pointing that out |
Hello VietAI team,
Thanks for sharing the pretrained models in your research paper. I am interested on fine tuning the VietAI/vit5-base language model for the abstractive summarization task. I have some questions:
Thanks a lot
The text was updated successfully, but these errors were encountered: