Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed extraction - Class CTTextCharacterProperties is missing. #32

Open
s4zuk3 opened this issue Nov 20, 2024 · 6 comments
Open

Failed extraction - Class CTTextCharacterProperties is missing. #32

s4zuk3 opened this issue Nov 20, 2024 · 6 comments
Labels
bug Something isn't working

Comments

@s4zuk3
Copy link
Contributor

s4zuk3 commented Nov 20, 2024

Hello! I’ve started using the library and came across some errors. This error appears when trying to extract the content from a PDF with default configurations:

Exception in thread "main": org.graalvm.nativeimage.MissingReflectionRegistrationError
org.graalvm.nativeimage.MissingReflectionRegistrationError: The program tried to reflectively instantiate the array class
org.openxmlformats.schemas.drawingml.x2006.main.CTTextCharacterProperties[]

It seems that this class is missing in the native image resources.

Thanks!

@nmammeri nmammeri added the bug Something isn't working label Nov 25, 2024
@nmammeri
Copy link
Contributor

It would be great if you can share the file to add it to our test suite. please let us if you can't share the file so that we can look further on how to fix the issue

@s4zuk3
Copy link
Contributor Author

s4zuk3 commented Nov 25, 2024

CTTextCharacterProperties_issue.pdf

Thanks!

@nmammeri
Copy link
Contributor

Hi Francisco, I actually got it fixed on my local machine. Was waiting for the xml issue to release. But if you need it urgently I can make a minor release

@s4zuk3
Copy link
Contributor Author

s4zuk3 commented Dec 16, 2024

Hi @nmammeri ! Thank you so much for having the fix! Is it possible to do a minor release for this? We want to use Extractous, but that error has been blocking us since then.

Thank you very much!

(Sorry, I had used another account by mistake, I deleted the comment)

@nmammeri
Copy link
Contributor

Hi @s4zuk3 0.3.0 is released with the fix. We also added extracting output as xml. you may well check it out 🎉. thanks for your patience

@nmammeri
Copy link
Contributor

@s4zuk3 this is fixed in v0.3.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants