Added pipline args to `HuggingFacePipeline.from_model_id` #5268

solomspd · 2023-05-25T20:56:15Z

The current HuggingFacePipeline.from_model_id does not allow passing of pipeline arguments to the transformer pipeline.
This PR enables adding important pipeline parameters like setting max_new_tokens for example.
Previous to this PR it would be necessary to manually create the pipeline through huggingface transformers then handing it to langchain.

For example instead of this

model_id = "gpt2"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
pipe = pipeline(
    "text-generation", model=model, tokenizer=tokenizer, max_new_tokens=10
)
hf = HuggingFacePipeline(pipeline=pipe)

You can write this

hf = HuggingFacePipeline.from_model_id(
    model_id="gpt2", task="text-generation", pipeline_kwargs={"max_new_tokens": 10}
)

Who can review?

@hwchase17
@agola11

…ai#5268) The current `HuggingFacePipeline.from_model_id` does not allow passing of pipeline arguments to the transformer pipeline. This PR enables adding important pipeline parameters like setting `max_new_tokens` for example. Previous to this PR it would be necessary to manually create the pipeline through huggingface transformers then handing it to langchain. For example instead of this ```py model_id = "gpt2" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id) pipe = pipeline( "text-generation", model=model, tokenizer=tokenizer, max_new_tokens=10 ) hf = HuggingFacePipeline(pipeline=pipe) ``` You can write this ```py hf = HuggingFacePipeline.from_model_id( model_id="gpt2", task="text-generation", pipeline_kwargs={"max_new_tokens": 10} ) ``` Co-authored-by: Dev 2049 <dev.dev2049@gmail.com>

solomspd and others added 4 commits May 25, 2023 23:02

added pipline args to HuggingFacePipeline

9a15c55

added doc strings

0f9b3c1

cr

f57058f

cr

26ae62d

dev2049 added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label May 25, 2023

dev2049 merged commit 2ef5579 into langchain-ai:master May 26, 2023

danielchalef mentioned this pull request Jun 5, 2023

Zep Hybrid Search #5742

Merged

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added pipline args to `HuggingFacePipeline.from_model_id` #5268

Added pipline args to `HuggingFacePipeline.from_model_id` #5268

solomspd commented May 25, 2023

Added pipline args to HuggingFacePipeline.from_model_id #5268

Added pipline args to HuggingFacePipeline.from_model_id #5268

Conversation

solomspd commented May 25, 2023

Who can review?

Added pipline args to `HuggingFacePipeline.from_model_id` #5268

Added pipline args to `HuggingFacePipeline.from_model_id` #5268