-
I've been using get_text to parse text from pdfs. Now I want to extend it by adding support for parsing text from scanned pdfs, pdfs containing text and images both on a single page. For this, I started using get_textpage_ocr. Question |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
If the page contains no images and if you use parameter |
Beta Was this translation helpful? Give feedback.
If the page contains no images and if you use parameter
full=False
then no OCR is being executed. Regular text will then be extracted like normal.