You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using the OCR-VQA model does not always give consistent results when the prompt is left unchanged
What is the most consitent way to use the model as an OCR?
The text was updated successfully, but these errors were encountered:
You can use the fine-tuned text-caps model, train it again on ocr tasks which involves learning to generate text from a given image. I hope this will get you there.
Using the OCR-VQA model does not always give consistent results when the prompt is left unchanged
What is the most consitent way to use the model as an OCR?
The text was updated successfully, but these errors were encountered: