Use query understanding for RAG retrieval #8

bdb-dd · 2023-10-16T11:25:54Z

Description

Updated 26.02.24:
Even after having successfully addressed the retrieval ranking issues we had earlier, there are still many opportunities for improving retrieval for specific kinds of queries. As an example, a user may wish to qualify their search by defining specific filters, such as "updated recently", "sorted by version number" or "only open issues".

The "Query understand" strategy calls for using LLMs to generate the retrieval query itself, based on a combination of knowledge of the underlying search engine, the data schemas involved and potentially some function calling extensions.

Evaluate

Give feedback

Evaluate traditional text indexing tools designed to deliver relevant results from free text queries
Test a specific free text query engine with a generic configuration suitable for our documentation data set
[in-progress] Gather requirements for evaluating and tuning free text query performance
[in-progress] Evaluate results and determine if there is a need to tune the configuration, for example for special handling of multiple languages, content that has been machine translated, recency, metadata
https://github.com/Altinn/digdir-slack-bot/issues/49
Options

Additional Information

No response

bdb-dd · 2023-10-23T21:56:57Z

First end to end test completed. Initial results look very promising.

Will likely require additional testing and content improvements to deal with issues related to certain topics.

Certain documents should probably be included in context regardless of search terms.

bdb-dd · 2023-10-27T11:25:37Z

Sent invitation to a broader group of people who can contribute with a varied set of user queries. Quickly finding examples where the first stage, extract search terms, is not as selective as it could be. A large number of search terms currently results in a smaller result set, sometimes including documents that are highly ranked for no apparent reason.

Have also tested asking GPT 3.5 for feedback on which of the supplied context documents were relevant, with good results. So one option would be to "pin" certain source documents, such that they are always included in the RAG context. The context length has varied significantly from query to query, sometimes exceeding 16K which is our current upper limit.

bdb-dd added the kind/feature-request New feature or request label Oct 16, 2023

bdb-dd changed the title ~~Improve context stuffing for RAG~~ Use query understanding for RAG retrieval Oct 17, 2023

bdb-dd self-assigned this Oct 23, 2023

bdb-dd transferred this issue from another repository Feb 25, 2024

renovate bot mentioned this issue Dec 1, 2024

fix(deps): update npm non-major dependencies #93

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use query understanding for RAG retrieval #8

Use query understanding for RAG retrieval #8

bdb-dd commented Oct 16, 2023 •

edited

Loading

Evaluate

bdb-dd commented Oct 23, 2023

bdb-dd commented Oct 27, 2023

Use query understanding for RAG retrieval #8

Use query understanding for RAG retrieval #8

Comments

bdb-dd commented Oct 16, 2023 • edited Loading

Description

Evaluate

Additional Information

bdb-dd commented Oct 23, 2023

bdb-dd commented Oct 27, 2023

bdb-dd commented Oct 16, 2023 •

edited

Loading