Community report grouping. #841

FatemaD1577 · 2024-08-06T06:51:33Z

Is there an existing issue for this?

I have searched the existing issues
I have checked #657 to validate if my issue is covered by community support

Describe the issue

I am trying to understand implementation of graph rag. I have gone through the documentation available and also through the git repo to understand the global query part. My understanding of the implementation is as follows:

We first create a graph based on the entities and relationships extracted.
Communities are then created by grouping closely related entities
At the next level community reports are created.
Whenever user query comes we shuffle the community reports randomly and group them
An intermediate response is generated for each group and a score is assigned to these groups based on the relevance of the answer to the user query
Responses with 0 score are filtered out and the remaining are passed on to the LLM for final response generation

Below are some of the doubts still not cleared:

Are community reports created at different levels to incrementally cover larger amount of information?
On what basis are the communities reports grouped? Is there any parameter to control number of community reports that will be included in one group?
Only responses with 0 score are filtered out or is there a threshold below which the responses will not be considered for final response generation?
On what basis are we scoring the intermediate response in relevance to user query? Is it using similarity search or any other method to determine the score?

Having an answer to these doubts will help me have more clarity on the process of the global querying part.

Thank you in advance.

Steps to reproduce

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

GraphRAG Version:
Operating System:
Python Version:
Related Issues:

Xiyuche · 2024-08-06T09:38:00Z

For the intermediate response score, this might be helpful, from the paper

natoverse · 2024-08-06T19:54:53Z

Moving to Discussions: #849

FatemaD1577 added the triage Default label assignment, indicates new issue needs reviewed by a maintainer label Aug 6, 2024

natoverse closed this as completed Aug 6, 2024

natoverse removed the triage Default label assignment, indicates new issue needs reviewed by a maintainer label Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Community report grouping. #841

Community report grouping. #841

FatemaD1577 commented Aug 6, 2024

Xiyuche commented Aug 6, 2024

natoverse commented Aug 6, 2024

Community report grouping. #841

Community report grouping. #841

Comments

FatemaD1577 commented Aug 6, 2024

Is there an existing issue for this?

Describe the issue

Steps to reproduce

GraphRAG Config Used

Logs and screenshots

Additional Information

Xiyuche commented Aug 6, 2024

natoverse commented Aug 6, 2024