-
-
Notifications
You must be signed in to change notification settings - Fork 572
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow extracting Libre Office documents #4001
Comments
Thanks. We already some minimal way to extract this in extractcode with an option for office document. This may not be the extract you expect, instead it treats modern office documents as the zip they really are, and process them as an archive. Could this be what you are looking for? |
As mentioned, this seems to work for Microsoft Office documents only, but not for Libre Office specific formats - at least with the dependencies bundled by the latest SCTK release. Handling them as ZIP files is totally fine for me. |
Hey @stefan6419846 I want to work on it , but as i was goin through the codebase I could'nt get where to implement the logic ,may i know i which dir or file should I implement it?? , I have already went through codebase but I am not getting it. |
The actual maintainers might know better about this, but I assume that this should probably go into https://github.com/aboutcode-org/extractcode |
Thanks @stefan6419846 for letting me know , I'll clone it and start working!! |
Hey @stefan6419846 checkout my PR and do let me know If I can improve it ! , aboutcode-org/extractcode#67 |
Short Description
SCTK should handle Libre Office documents like Excel documents, id est extract the corresponding container as possible.
Possible Labels
Select Category
Describe the Update
SCTK already supports extracting Excel documents. Especially for free software, Libre Office documents are rather common as well, but cannot be extracted at the moment, while the general structure seems to be similar to at least some extent (archive holding XML files).
How This Feature will help you/your organization
Make sure to catch licenses from Libre Office documents, which are used for documentation for example.
Possible Solution/Implementation Details
Add functionality to extract the corresponding containers.
Example/Links if Any
Gnome File Roller supports extracting the individual files of a
.ods
(spreadsheet) file for example.The text was updated successfully, but these errors were encountered: