retriever.get_relevant_documents is broken. tutorial (PART 1-4) #19

ca-mi-lo · 2024-04-30T17:57:21Z

Following the tutorial (PART 1-4), I noticed that the model answered with "not enough information to answer". Looking at "docs" I get this output:
[Document(page_content='Conversatin samples:\n[\n {\n "role": "system",', metadata={'source': 'https://lilianweng.github.io/posts/2023-06-23-agent/'})]
Seems like docs = retriever.get_relevant_documents("What is Task Decomposition?") is not generating a correct page_content.
If I invoke the chain for the whole context, aka splits, not docs I do get a meaningful answer.
chain.invoke({"context":splits,"question":"What is Task Decomposition?"})

Note: I'm using using googles API, but I would expect that to be irrelevant.

The text was updated successfully, but these errors were encountered:

rcorneanu · 2024-09-04T06:58:53Z

I've had the same issue with Ollama + llama3.1.
After changing the embeddings model to "nomic-embed-text" instead of using the LLM, it worked great.

vectorstore = Chroma.from_documents(documents=splits,
embedding=OllamaEmbeddings(model="nomic-embed-text"))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retriever.get_relevant_documents is broken. tutorial (PART 1-4) #19

retriever.get_relevant_documents is broken. tutorial (PART 1-4) #19

ca-mi-lo commented Apr 30, 2024

rcorneanu commented Sep 4, 2024

retriever.get_relevant_documents is broken. tutorial (PART 1-4) #19

retriever.get_relevant_documents is broken. tutorial (PART 1-4) #19

Comments

ca-mi-lo commented Apr 30, 2024

rcorneanu commented Sep 4, 2024