Skip to content

[Bug]: Mismatch between header in community report generation prompt examples and input data (id vs human_readable_id) #860

@ksachdeva

Description

@ksachdeva

Do you need to file an issue?

  • I have searched the existing issues and this bug is not already filed.
  • My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
  • I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.

Describe the bug

image

Above is an image showing the portion of community_report extraction prompt.

Entities

id,entity,description


Relationships

id,source,target,description

The context data that is generated ends up having headers that are a bit different

Entities
human_readable_id,title,description


Relationships

human_readable_id,source,target,description

The differences are

For Entities - generate context has human_readable_id instead of id and title instead of entities
For Relationships - generate context has human_readable_id instead of id

The bug can be fixed by modifying the code or making the headers in the prompt compliant with the ones generated by the code.

For code, you would need modifications in these verbs.

This is happening because of

and

Steps to reproduce

You can look into the cache files. I am attaching one here.

example_community_report_cache.txt

Expected Behavior

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

  • GraphRAG Version:
  • Operating System:
  • Python Version:
  • Related Issues:

Metadata

Metadata

Assignees

Labels

backlogWe've confirmed some action is needed on this and will plan itbugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions