Skip to content

Fine tune Detection and Recognition models on custom dataset #637

Answered by fg-mindee
hkhanr asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @hkhanr 👋

About the recognition format, I'm not sure I understand your question, but the README does provide examples about the format. But text recognition doesn't have any localization information. Text detection does.

So if you have the OCR annotations (localization + text) of some documents. For the text detection training, only remove the text information. For the text recognition, you will have to crop each localized word.

And then, imagine you have a folder called "images" and put the cropped images in it: crop1.jpg, crop2.jpg, crop3.jpg. Then your labels should be in json format in a "labels.json" next to the "images" folder, and should have this structure:

{"crop1.jpg": "I", "…

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@hkhanr
Comment options

@fg-mindee
Comment options

@hkhanr
Comment options

@fg-mindee
Comment options

@hkhanr
Comment options

Answer selected by hkhanr
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants