[Feature Request] Document toolkits #1408

Wendong-Fan · 2025-01-07T11:51:29Z

I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Has the ability to process universal document types whether they are in urls or local files. It is supported by other bottom-level toolkits such as video toolkit, audio toolkit, etc. It uses GraphRAG to process raw text and query.
Support text modification, changing existing text to improve quality, adjust tone, or better suit a specific audience. This is a fundamental task in content creation and editing. (low priority)

some implementation already done by MengKang

No response

No response

No response

Wendong-Fan · 2025-01-09T17:25:37Z

Aaron617 · 2025-01-10T05:33:00Z

After testing the current document toolkit on GAIA, me and @Ralph-Zhou found several problems:

After processing longer content, it is basically converted into a RAG task. However, the current RAG capability is still somewhat behind agents like H2O. (current reference repository: https://github.com/shibing624/ChatPDF)
Function extract_webpage_content needs enhancement. Currently it may miss some content (for example, the table is not processed well).

Wendong-Fan added New Feature call for contribution P0 Task with high level priority labels Jan 7, 2025

Wendong-Fan added this to the Sprint 20 milestone Jan 7, 2025

Wendong-Fan added this to Project Camel Jan 7, 2025

Wendong-Fan assigned willshang76 Jan 7, 2025

Wendong-Fan removed the call for contribution label Jan 7, 2025

Wendong-Fan assigned Aaron617 and AveryYay Jan 9, 2025

Wendong-Fan modified the milestones: Sprint 20, Sprint 21 Jan 9, 2025

Provide feedback