Add chat support to text generation pipeline #28945

Rocketknight1 · 2024-02-09T17:32:38Z

This PR modifies the text generation pipeline to support chats. It does this by inspecting the inputs - if they look like strings, it uses the original causal LM pipeline, and if they look like lists of message dicts, it applies a chat template instead before proceeding with generation.

Most changes are in the preprocessing/postprocessing - the actual generation itself is largely unchanged.

TODO:

Expand tests to cover other edge cases
Confirm the return format we want for this - just the model response, or the entire chat?
~~Add KV cache support, as this is important for performant multi-turn chat~~
Deprecate ConversationalPipeline and update the chat template docs to refer to this instead?

cc @ArthurZucker @gante @LysandreJik

HuggingFaceDocBuilderDev · 2024-02-09T17:51:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

julien-c

looks neat from a (very) superficial glance

I think this will be quite useful!

julien-c · 2024-02-09T18:12:00Z

(and yes we should remove the old ConversationalPipeline sooner rather than later given it already doesn't work anymore due to conversational pipeline-type being removed from the Hub, IIUC)

…d of ConversationalPipeline

Rocketknight1 · 2024-02-12T18:15:53Z

@julien-c Done! This PR now adds a DeprecationWarning to ConversationalPipeline. I also updated the chat template docs for the new pipeline.

julien-c · 2024-02-12T19:00:18Z

very nice!

gante

Nice! Thank for adding this 🐈

src/transformers/pipelines/conversational.py

src/transformers/pipelines/text_generation.py

gante · 2024-02-14T17:55:16Z

src/transformers/pipelines/text_generation.py

+            if isinstance(text_inputs[0], dict):
+                return super().__call__(Chat(text_inputs), **kwargs)
+            else:
+                chats = [Chat(chat) for chat in text_inputs]  # 🐈 🐈 🐈


best comment 😂

Rocketknight1 · 2024-02-15T14:56:30Z

One question for people, maybe @gante: Are you okay with the return format I'm using? Right now, if you pass a chat like this:

[ 
    {"role": "system", "content": "This is a system message."},
    {"role": "user", "content": "This is a test"},
]

You get a response that's the same chat, continued:

[
    {"role": "system", "content": "This is a system message."},
    {"role": "user", "content": "This is a test"},
    {"role": "assistant", "content": "This is a reply"},
]

I think this is the right thing to do, because it matches the behaviour of the existing text-generation pipeline (it returns the prompt at the start of the generated string). Let me know if you have a different opinion, though!

gante · 2024-02-15T14:57:39Z

IMO it looks good to me

Rocketknight1 · 2024-02-15T14:57:48Z

Cool!

Rocketknight1 · 2024-02-15T14:58:37Z

In that case, I think we're ready for final review (cc @amyeroberts) - I'm leaving the KV cache to another PR.

Rocketknight1 · 2024-02-15T14:59:01Z

cc @LysandreJik @julien-c as well if there's anything else you want me to add before we merge this!

amyeroberts

Beautiful - thanks for adding this support!

src/transformers/tokenization_utils_base.py

src/transformers/pipelines/text_generation.py

amyeroberts · 2024-02-15T21:03:27Z

src/transformers/pipelines/text_generation.py

              ids of the generated text.
        """
-        return super().__call__(text_inputs, **kwargs)
+        if isinstance(text_inputs, (list, tuple)) and isinstance(text_inputs[0], (list, tuple, dict)):


Just to make sure - is it not possible for someone to pass this to the pipeline:

# Pass a list-of-list-of-strings generator([["this is a dog"], ["this is a code example"], ["banana for scale"]])

I tried that on main - it just results in a TypeError: can only concatenate str (not "list") to str. The existing pipeline will only accept either a single string or a non-nested list/tuple of strings, so I don't think this check makes a mistake for any valid inputs!

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Add chat support to text generation pipeline

e7c4172

julien-c reviewed Feb 9, 2024

View reviewed changes

Rocketknight1 added 5 commits February 9, 2024 18:29

Better handling of single elements

a2e190a

Deprecate ConversationalPipeline

a5e1ccc

stash commit

4444a1e

Add missing add_special_tokens kwarg

6772615

Update chat templating docs to refer to TextGenerationPipeline instea…

de3d88a

…d of ConversationalPipeline

Rocketknight1 added 2 commits February 14, 2024 16:43

Add ✨TF✨ tests

7eb468d

@require_tf

6fae42d

gante approved these changes Feb 14, 2024

View reviewed changes

Rocketknight1 added 2 commits February 15, 2024 14:21

Add type hint

3164035

Add specific deprecation version

01fc1a6

Rocketknight1 requested a review from amyeroberts February 15, 2024 14:58

Rocketknight1 added 2 commits February 15, 2024 15:01

Remove unnecessary do_sample

1b3f53f

Remove todo - the discrepancy has been resolved

bbd8cfc

amyeroberts approved these changes Feb 15, 2024

View reviewed changes

Rocketknight1 and others added 2 commits February 16, 2024 13:48

Update src/transformers/tokenization_utils_base.py

ecea9b5

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Update src/transformers/pipelines/text_generation.py

f985755

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Rocketknight1 merged commit 2f1003b into main Feb 16, 2024

Rocketknight1 deleted the support_chat_in_text_gen_pipeline branch February 16, 2024 16:41

Rocketknight1 mentioned this pull request Feb 22, 2024

Extend Chat Template Tokenization for Training/Finetuning #27609

Open

Add chat support to text generation pipeline #28945

Add chat support to text generation pipeline #28945

Uh oh!

Conversation

Rocketknight1 commented Feb 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Feb 9, 2024

Uh oh!

julien-c left a comment

Choose a reason for hiding this comment

Uh oh!

julien-c commented Feb 9, 2024

Uh oh!

Rocketknight1 commented Feb 12, 2024

Uh oh!

julien-c commented Feb 12, 2024

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gante Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Feb 15, 2024

Uh oh!

gante commented Feb 15, 2024

Uh oh!

Rocketknight1 commented Feb 15, 2024

Uh oh!

Rocketknight1 commented Feb 15, 2024

Uh oh!

Rocketknight1 commented Feb 15, 2024

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

amyeroberts Feb 15, 2024

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Rocketknight1 commented Feb 9, 2024 •

edited

Loading