Support for LCEL Runnables #1586

joshuasundance-swca · 2023-10-20T21:51:03Z

Summary

This PR addresses #1564 by enhancing the LangChain representation class to support more flexible LangChain pipelines, including LCEL Runnables. I am certainly open to more changes as needed before merging.

Changes

Generalize chain parameter to take any Runnable object instead of just QA chains
Add chain_config parameter to support RunnableConfig
Use .batch() instead of .run() to invoke chains
Fix return type in extract_topics
Update docstring for LangChain.__init__

Example Usage

Here is an example of using a custom LCEL pipeline with the updated LangChain class (from the updated docstring):

from bertopic.representation import LangChain
from langchain.chains.question_answering import load_qa_chain
from langchain.chat_models import ChatAnthropic
from langchain.schema.document import Document
from langchain.schema.runnable import RunnablePassthrough
from langchain_experimental.data_anonymizer.presidio import PresidioReversibleAnonymizer

prompt = ...
llm = ...

# We will construct a special privacy-preserving chain using Microsoft Presidio

pii_handler = PresidioReversibleAnonymizer(analyzed_fields=["PERSON"])

chain = (
        {
            "input_documents": (
                lambda inp: [
                    Document(
                        page_content=pii_handler.anonymize(
                            d.page_content,
                            language="en",
                        ),
                    )
                    for d in inp["input_documents"]
                ]
            ),
            "question": RunnablePassthrough(),
        }
        | load_qa_chain(representation_llm, chain_type="stuff")
        | (lambda output: {"output_text": pii_handler.deanonymize(output["output_text"])})
)

representation_model = LangChain(chain, prompt=representation_prompt)

MaartenGr

Incredible PR! Thanks to the extensive documentation, description, changes, etc. this was a pleasure collaborating on. I only have two, very minor, suggestions but other than that it looks great.

bertopic/representation/_langchain.py

MaartenGr · 2023-10-29T08:22:45Z

Awesome, thanks for the great collaboration and the extensive work! I highly appreciate such thorough PRs/Issues and I think users will love to have this feature. Also, the example you gave is just great!

joshuasundance-swca added 9 commits October 5, 2023 17:46

batch runnables w/ config & kwargs

a93f116

mypy

5f028b4

text -> output_text

7a92662

text -> output_text

e1ca13d

merge conflict resolution

a47599c

cleanup

c1bee0a

cleanup

55330e6

reorder parameters for backwards compatibility

cbff28e

update docstring

0c689b1

MaartenGr reviewed Oct 23, 2023

View reviewed changes

bertopic/representation/_langchain.py Outdated Show resolved Hide resolved

bertopic/representation/_langchain.py Outdated Show resolved Hide resolved

bertopic/representation/_langchain.py Show resolved Hide resolved

docstring formatting

8d96b04

MaartenGr merged commit b57a8db into MaartenGr:master Oct 29, 2023
2 checks passed

joshuasundance-swca deleted the batch branch October 30, 2023 14:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for LCEL Runnables #1586

Support for LCEL Runnables #1586

joshuasundance-swca commented Oct 20, 2023

MaartenGr left a comment

MaartenGr commented Oct 29, 2023

Support for LCEL Runnables #1586

Support for LCEL Runnables #1586

Conversation

joshuasundance-swca commented Oct 20, 2023

Summary

Changes

Example Usage

MaartenGr left a comment

Choose a reason for hiding this comment

MaartenGr commented Oct 29, 2023