Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow extracting Libre Office documents #4001

Open
3 of 7 tasks
stefan6419846 opened this issue Nov 28, 2024 · 2 comments
Open
3 of 7 tasks

Allow extracting Libre Office documents #4001

stefan6419846 opened this issue Nov 28, 2024 · 2 comments

Comments

@stefan6419846
Copy link

Short Description

SCTK should handle Libre Office documents like Excel documents, id est extract the corresponding container as possible.

Possible Labels

  • new feature
  • archive extraction

Select Category

  • Enhancement
  • Add License/Copyright
  • Scan Feature
  • Packaging
  • Documentation
  • Expand Support
  • Other

Describe the Update

SCTK already supports extracting Excel documents. Especially for free software, Libre Office documents are rather common as well, but cannot be extracted at the moment, while the general structure seems to be similar to at least some extent (archive holding XML files).

How This Feature will help you/your organization

Make sure to catch licenses from Libre Office documents, which are used for documentation for example.

Possible Solution/Implementation Details

Add functionality to extract the corresponding containers.

Example/Links if Any

Gnome File Roller supports extracting the individual files of a .ods (spreadsheet) file for example.

@pombredanne
Copy link
Member

Thanks. We already some minimal way to extract this in extractcode with an option for office document. This may not be the extract you expect, instead it treats modern office documents as the zip they really are, and process them as an archive. Could this be what you are looking for?

@stefan6419846
Copy link
Author

As mentioned, this seems to work for Microsoft Office documents only, but not for Libre Office specific formats - at least with the dependencies bundled by the latest SCTK release. Handling them as ZIP files is totally fine for me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants