Better PDF ingestion/parsing via Docling + pymupdf4llm #419

rmusser01 · 2024-11-09T01:36:07Z

Option to select pymupdf, pymupdf4llm and Docling is now available in the Test PDF Ingestion and PDF Ingestion tabs.
Fixed install script for linux, will now properly execute tldw at the end of a succesful installation.
Updated requirements.txt for docling + pymupdf4llm.
Updated install instructions for Linux to use latest cuda version.

rmusser01 added 3 commits November 4, 2024 18:33

Prompts

9b976d7

Fixed Install instructions for Linux + fixed the install script

91da390

Add Docling and pymupdf4llm + started writing a user guide.

b91d63f

rmusser01 merged commit f3e5f5b into main Nov 9, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better PDF ingestion/parsing via Docling + pymupdf4llm #419

Better PDF ingestion/parsing via Docling + pymupdf4llm #419

rmusser01 commented Nov 9, 2024 •

edited

Loading

Better PDF ingestion/parsing via Docling + pymupdf4llm #419

Better PDF ingestion/parsing via Docling + pymupdf4llm #419

Conversation

rmusser01 commented Nov 9, 2024 • edited Loading

rmusser01 commented Nov 9, 2024 •

edited

Loading