Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better PDF ingestion/parsing via Docling + pymupdf4llm #419

Merged
merged 3 commits into from
Nov 9, 2024
Merged

Conversation

rmusser01
Copy link
Owner

@rmusser01 rmusser01 commented Nov 9, 2024

Option to select pymupdf, pymupdf4llm and Docling is now available in the Test PDF Ingestion and PDF Ingestion tabs.
Fixed install script for linux, will now properly execute tldw at the end of a succesful installation.
Updated requirements.txt for docling + pymupdf4llm.
Updated install instructions for Linux to use latest cuda version.

@rmusser01 rmusser01 merged commit f3e5f5b into main Nov 9, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant