Extract tagged tables from pdf using 'StructTreeRoot' #3899
Unanswered
mllife
asked this question in
Looking for help
Replies: 1 comment
-
How make the code parse the tables from the StructTreeRoot and extract json or csv using this information? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
So, I am working with pdfs and now I need to see, if I can parse the tables in pdfs (if tagged)
Reference - https://accessible-pdf.info/en/basics/general/overview-of-the-pdf-tags/#table-elements
Attaching a sample pdf with tagged tables;
FishTables-only.pdf
Beta Was this translation helpful? Give feedback.
All reactions