fix(truss): make the passing in the chat template more robust BT-15187 #1881

aghilann · 2025-08-21T21:13:04Z

🚀 What

If chat_template.jinja exists at the top level (which it usually should), pass it in explicitly. If it does not exist there, it should be defined inside tokenizer_config.json. vLLM does not implicitly find a Jinja template if it is not passed in. Due to a change in Transformers, the chat_template key is no longer included inside tokenizer_config.json.

PR where HF removed the chat_template field in the tokenizer

💻 How

🔬 Testing

Ran the code to ensure the model starts correctly and I can chat it, With chat_template.jinja at the top level versus defined inside tokenizer_config.json - both cases of the if statement. Actually talked to the model with the chat endpoint, not just waiting for it to come up.

linear · 2025-08-21T21:29:34Z

BT-15187 [truss] Implement robustness around the chat template

rcano-baseten

if we need to extend this at all in the future, we should probably send information back from the server about the checkpoint

rcano-baseten · 2025-08-21T22:36:27Z

truss/cli/train/deploy_checkpoints/deploy_full_checkpoints.py

    setup_environment_variables_and_secrets,
 )

+# NB(aghilan): Transformers was recently changed to save a chat_template.jinja file instead of inside the tokenizer_config.json file.


what version? might be helpful to know when it's safe to remove this logic/change this logic?

Aghilan Nathan added 3 commits August 20, 2025 12:10

fix(truss): explicitly pass in the chat_template flag and file

67c5914

fix(truss): move deployments more robust

a4527e2

fix(truss): unit test for vllm serve command

df83df0

aghilann changed the title ~~fix(truss): make the passing in the chat template more robust~~ fix(truss): make the passing in the chat template more robust BT-15187 Aug 21, 2025

aghilann requested review from bdubayah and rcano-baseten August 21, 2025 21:30

rcano-baseten approved these changes Aug 21, 2025

View reviewed changes

fix(truss): add comment

562433b

aghilann enabled auto-merge (squash) August 21, 2025 23:20

aghilann merged commit eecf341 into main Aug 21, 2025
18 checks passed

aghilann deleted the chat-template-fix branch August 21, 2025 23:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(truss): make the passing in the chat template more robust BT-15187 #1881

fix(truss): make the passing in the chat template more robust BT-15187 #1881

Uh oh!

aghilann commented Aug 21, 2025 •

edited

Loading

Uh oh!

linear bot commented Aug 21, 2025

Uh oh!

rcano-baseten left a comment

Uh oh!

rcano-baseten Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(truss): make the passing in the chat template more robust BT-15187 #1881

fix(truss): make the passing in the chat template more robust BT-15187 #1881

Uh oh!

Conversation

aghilann commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 What

💻 How

🔬 Testing

Uh oh!

linear bot commented Aug 21, 2025

Uh oh!

rcano-baseten left a comment

Choose a reason for hiding this comment

Uh oh!

rcano-baseten Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aghilann commented Aug 21, 2025 •

edited

Loading