Allow apply_chat_template to pass kwargs to the template and support a dict of templates #29658

Rocketknight1 · 2024-03-14T14:29:28Z

As the title suggests, this is a simple PR that allows two new features:

kwargs can be passed through apply_chat_template to the template renderer.
Models can have a dict of multiple chat templates, which can be accessed by passing their name to apply_chat_template (this is used by the new Command-R model)

This PR is very slightly breaking (we used to pass kwargs to the tokenizer), but I don't think this was commonly used, and I think the new usage is more intuitive. Users can still pass kwargs through the method to the tokenizer with the tokenizer_kwargs argument.

Other than that, it should have no effect on existing users/templates!

HuggingFaceDocBuilderDev · 2024-03-14T15:01:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…n self.get_tokenizers()

LysandreJik · 2024-03-14T15:39:32Z

src/transformers/tokenization_utils_base.py

+        tokenizer_kwargs: Optional[Dict[str, Any]] = None,
+        **kwargs,


Is this a breaking change? 👀

Previously passed "tokenizer kwargs" will now be passed to "kwargs"

LysandreJik · 2024-03-14T15:45:28Z

src/transformers/tokenization_utils_base.py

+            template_dict = self.chat_template or self.default_chat_template
+            if chat_template is not None and chat_template in template_dict:
+                # The user can pass the name of a template to the chat template argument instead of an entire template
+                chat_template = template_dict[chat_template]


At this point, isn't chat_template the actual chat template that's passed as input?

chat_template (str, *optional*): A Jinja template to use for this conversion.

It's being used as the key in the template_dict, but what if the user had passed a chat_template to be used as a template?

It seems like if self.chat_template is a dict, when passing in chat_template it is always considered a key when it could be a template. If self.chat_template isn't a dict and self.default_chat_template isn't either, then chat_template is now considered a chat template

It's a bit hacky, but chat_template is only used as a key if it exists as a key in the template_dict. If the user passes an actual Jinja template, that almost certainly will not exist as a key, and so chat_template is treated as a template string.

LysandreJik

Awesome, this looks good to me! Thanks @Rocketknight1

This'll end up changing some templates on the Hub so let's let others that depend on it know.

…a dict of templates (#29658) * Allow apply_chat_template to pass kwargs to the template * Fix priority for template_kwargs * Fix docstring * style fix * Add the option for the model to have a dict of templates * Error message cleanup * Add test for chat template dicts * Simplify the chat template dict test and apply it to all tokenizers in self.get_tokenizers() * Save chat template dicts as lists with fixed key names * Add test for serialization/reloading * Add require_jinja just to be safe, even though I don't think we use it

Rocketknight1 added 4 commits March 14, 2024 14:28

Allow apply_chat_template to pass kwargs to the template

1e908c7

Fix priority for template_kwargs

024f786

Fix docstring

0222077

style fix

2cb238c

Add the option for the model to have a dict of templates

1df5db7

Rocketknight1 changed the title ~~Allow apply_chat_template to pass kwargs to the template~~ Allow apply_chat_template to pass kwargs to the template and support a dict of templates Mar 14, 2024

Rocketknight1 added 3 commits March 14, 2024 15:07

Error message cleanup

6a39d6c

Add test for chat template dicts

8d9789c

Simplify the chat template dict test and apply it to all tokenizers i…

53ca0fb

…n self.get_tokenizers()

LysandreJik reviewed Mar 14, 2024

View reviewed changes

LysandreJik approved these changes Mar 14, 2024

View reviewed changes

Rocketknight1 added 3 commits March 14, 2024 17:50

Save chat template dicts as lists with fixed key names

6b71e6e

Add test for serialization/reloading

e814006

Add require_jinja just to be safe, even though I don't think we use it

97442a6

Rocketknight1 merged commit 48fbab7 into main Mar 14, 2024
21 checks passed

Rocketknight1 deleted the allow_chat_template_kwargs branch March 14, 2024 18:23

Rocketknight1 mentioned this pull request Mar 14, 2024

Cohere Model Release #29622

Merged

4 tasks

CISC mentioned this pull request Apr 8, 2024

Models with multiple chat templates abetlen/llama-cpp-python#1336

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow apply_chat_template to pass kwargs to the template and support a dict of templates #29658

Allow apply_chat_template to pass kwargs to the template and support a dict of templates #29658

Rocketknight1 commented Mar 14, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 14, 2024

LysandreJik Mar 14, 2024

LysandreJik Mar 14, 2024

LysandreJik Mar 14, 2024

Rocketknight1 Mar 14, 2024

LysandreJik left a comment

Allow apply_chat_template to pass kwargs to the template and support a dict of templates #29658

Allow apply_chat_template to pass kwargs to the template and support a dict of templates #29658

Conversation

Rocketknight1 commented Mar 14, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Mar 14, 2024

LysandreJik Mar 14, 2024

Choose a reason for hiding this comment

LysandreJik Mar 14, 2024

Choose a reason for hiding this comment

LysandreJik Mar 14, 2024

Choose a reason for hiding this comment

Rocketknight1 Mar 14, 2024

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Mar 14, 2024 •

edited

Loading