Replies: 1 comment
-
Hi @jidalii 👋, One thing you could try is to binarize the image because I think the models have maybe problems with the blue X.
Another option would be to fine tune the preferred model (for this case a few samples should be enough ~20-40) See: https://mindee.github.io/doctr/using_doctr/custom_models_training.html A labeling tool can be found here: https://github.com/text2knowledge/docTR-Labeler - early phase release (Or take any other tool of your choice) - boxes should be as close as possible the the text (word) Best, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I currently use docTR to convert scanned PDF files into CSV files. One section of the sheet contains cross marks. However, I tried different text detection models, and neither can detect cross marks. I wonder whether there are any missing features from docTR that I overlooked for the cross mark detection, or there is an alternative solution for it. Thanks!
Here is the image and screenshot of the detection result:
Here is the code:
Beta Was this translation helpful? Give feedback.
All reactions