[trace][semantic] attributes for re-ranking #1153

mikeldking · 2023-08-18T16:20:33Z

Use-cases:
Sometimes the re
Use cohere re-ranker
Use GPT based prompt re-ranker
General cross-encoder based re-ranker

model or strategy used to re-rank
final re-ordering of documents
scores from the model
re-ranked documents

This will allow us to compute NCDG of retrieval

axiomofjoy · 2023-08-23T00:54:07Z

LlamaIndex supports re-ranking via:

generative LLMs
Cohere API
sbert

These re-rankers are implemented as node post-processors and do not have specialized callback hooks. All post-processors are currently run inside of this retrieval step, so the payload to the on_event_end hook with RETRIEVE callback event type includes the retrieved documents post-reranking. The callback system does not currently have access to the retrieved documents pre-reranking, nor does it have access to any of the re-ranking data or metadata (e.g., model name or scores).

axiomofjoy · 2023-08-23T00:57:31Z

It's unclear to me whether re-ranking is worthy of its own span, and if so, what the span kind would be, or if re-ranking data should be attached to the retrieval span via semantic conventions.

axiomofjoy · 2023-08-23T00:58:26Z

Script to run the LlamaIndex Cohere re-ranker with our callback handler:

import os

from phoenix.experimental.callbacks.llama_index_trace_callback_handler import (
    OpenInferenceTraceCallbackHandler,
)

from llama_index import ServiceContext, SimpleDirectoryReader, VectorStoreIndex
from llama_index.callbacks import CallbackManager
from llama_index.indices.postprocessor.cohere_rerank import CohereRerank
from llama_index.response.pprint_utils import pprint_response

documents = SimpleDirectoryReader(
    "/Users/xandersong/llama_index/docs/examples/data/paul_graham"
).load_data()

index = VectorStoreIndex.from_documents(documents=documents)

# api_key = os.environ["COHERE_API_KEY"]
callback_handler = OpenInferenceTraceCallbackHandler()
service_context = ServiceContext.from_defaults(
    callback_manager=CallbackManager(handlers=[callback_handler])
)
cohere_rerank = CohereRerank(
    # api_key=api_key,
    top_n=2
)
query_engine = index.as_query_engine(
    similarity_top_k=10,
    node_postprocessors=[cohere_rerank],
    service_context=service_context,
)
callback_handler = OpenInferenceTraceCallbackHandler()
response = query_engine.query(
    "What did Sam Altman do in this essay?",
)
pprint_response(response)

print(callback_handler._tracer.span_buffer)

axiomofjoy · 2023-08-23T02:02:58Z

LangChain implements Cohere re-ranking. Here's a script:

from langchain.llms import OpenAI
from langchain.retrievers import ContextualCompressionRetriever
from langchain.retrievers.document_compressors import CohereRerank
from langchain.chains import RetrievalQA

from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain.embeddings import OpenAIEmbeddings
from langchain.document_loaders import TextLoader
from langchain.vectorstores import Chroma
from phoenix.experimental.callbacks.langchain_tracer import OpenInferenceTracer


documents = TextLoader(
    "/Users/xandersong/langchain/docs/extras/modules/state_of_the_union.txt"
).load()
text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=100)
texts = text_splitter.split_documents(documents)
base_retriever = Chroma.from_documents(texts, OpenAIEmbeddings()).as_retriever(
    search_kwargs={"k": 20}
)
llm = OpenAI(temperature=0)
compressor = CohereRerank()
compression_retriever = ContextualCompressionRetriever(
    base_compressor=compressor, base_retriever=base_retriever
)

tracer = OpenInferenceTracer()
chain = RetrievalQA.from_chain_type(llm=llm, retriever=compression_retriever)
query = "What did the president say about Ketanji Brown Jackson"
output = chain({"query": query}, callbacks=[tracer])
print(output)

axiomofjoy · 2023-08-23T02:05:04Z

Instead of implementing as a node post-processor, the re-ranking happens by wrapping the base retriever (e.g., the one using a simple cosine similarity search) in a second ContextualCompressionRetriever that has access to the Cohere re-ranker. Both the document pre- and post-re-ranking are available in the callback system. As far as I can tell, the re-ranking relevance scores from the Cohere endpoint are not available.

axiomofjoy · 2023-08-23T02:07:01Z

LangChain provides an opinion here that re-rankers are just retrievers. There is a parent retriever span (the re-ranker span) that has a child retriever span (the cosine similarity span).

mikeldking · 2023-09-07T22:45:03Z

Let's schedule this one being focused on llama_index and figure out if we can surface up some information that is ultimately useful towards calculating things like NCDG.

mikeldking mentioned this issue Aug 18, 2023

🗺️ LLM Application Tracing #1000

Closed

mikeldking assigned axiomofjoy Aug 18, 2023

axiomofjoy assigned axiomofjoy and unassigned axiomofjoy Aug 21, 2023

mikeldking unassigned axiomofjoy Aug 24, 2023

mikeldking assigned RogerHYang Sep 7, 2023

mikeldking added the c/traces label Sep 24, 2023

mikeldking mentioned this issue Sep 27, 2023

🗺️ Better Traces UX #1510

Closed

16 tasks

RogerHYang mentioned this issue Oct 7, 2023

feat(traces): add reranking span kind for document reranking in llama index #1588

Merged

RogerHYang closed this as completed in #1588 Oct 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[trace][semantic] attributes for re-ranking #1153

[trace][semantic] attributes for re-ranking #1153

mikeldking commented Aug 18, 2023 •

edited by axiomofjoy

Loading

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023 •

edited

Loading

mikeldking commented Sep 7, 2023

[trace][semantic] attributes for re-ranking #1153

[trace][semantic] attributes for re-ranking #1153

Comments

mikeldking commented Aug 18, 2023 • edited by axiomofjoy Loading

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023

axiomofjoy commented Aug 23, 2023 • edited Loading

mikeldking commented Sep 7, 2023

mikeldking commented Aug 18, 2023 •

edited by axiomofjoy

Loading

axiomofjoy commented Aug 23, 2023 •

edited

Loading