Replies: 1 comment 3 replies
-
Hi @ajkdrag 👋, So based on the font, it's very special 😅 As far as clustering is concerned, I would be open to suggestions, additions and optimizations. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Currently the process of merging word bboxes into lines uses spacing to decide which boxes to merge. It would be really nice if there was a flag/option to do merging based on different algorithms/clustering techniques, i.e. cluster based on font similarity and proximity, or say, cluster based on bbox shape. This is useful in some documents where there are multiple fonts and the merging isn't very precise. Although I understand that this doesn't happen very happen, but I think it would be a nice addition.
Beta Was this translation helpful? Give feedback.
All reactions