How to remove the image-with-text from the PDF #1393

SurinameClubcard · 2024-09-08T12:48:40Z

Hi,

I'm trying to OCR an old PDF and OCRmyPDF is actually doing a great job.

But next step in my workflow would be to use Google Translate to translate it from English to Dutch. The result looks like this:

The processed image text from the original PDF is not removed, which makes sense (how would Google know?).

Is there an option to OCRmyPDF to actually remove the image-with-text from the PDF that resulted in the OCR content? I do not want to remove all images; the PDF also contains pictures that should be kept.

Regards!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to remove the image-with-text from the PDF #1393

How to remove the image-with-text from the PDF #1393

SurinameClubcard commented Sep 8, 2024

How to remove the image-with-text from the PDF #1393

How to remove the image-with-text from the PDF #1393

Comments

SurinameClubcard commented Sep 8, 2024