No chat template in evaluation #2459

xueyan-lii · 2025-03-05T16:13:56Z

In LLAMA3 tokenizer, chat template is applied with the option to add system instruction. However, when using eleuther_eval.py, the tokenization process do not apply chat template. How can this be done?

felipemello1 · 2025-03-05T17:07:28Z

@joecummings , mind taking a look?

krammnic · 2025-03-11T19:06:07Z

@felipemello1 It should be simple fix. We need to expose apply_chat_template argument in our wrapper (notice that it is not bool!). According to lm_eval: chat_template=getattr(lm, "apply_chat_template").

pbontrager · 2025-03-12T19:56:08Z

I explored using torchtune's apply_chat_template here but since Eleuther does it's own formatting, they'd conflict with each other. To echo what @krammnic already said, we need to

override HFLM's "apply_chat_template" method in _LLMEvalWrapper shown here. This should stay mostly the same but grab the chat_template function that's inside the torchtune tokenizer here.
We also need to verify that the apply_chat_template from HF tokenizer is doing the same thing as torchtune's apply template.

krammnic · 2025-04-01T09:02:35Z

@pbontrager Can I take this now?

joecummings · 2025-04-01T14:18:23Z

@pbontrager Can I take this now?

fire the lasers

felipemello1 added the bug Something isn't working label Mar 10, 2025

pbontrager mentioned this issue Mar 11, 2025

Update Eleuther Eval to use Message format #2482

Closed

11 tasks

pbontrager linked a pull request Mar 11, 2025 that will close this issue

Update Eleuther Eval to use Message format #2482

Closed

11 tasks

pbontrager removed a link to a pull request Mar 12, 2025

Update Eleuther Eval to use Message format #2482

Closed

11 tasks

pbontrager added enhancement New feature or request and removed bug Something isn't working labels Mar 12, 2025

pbontrager added the community help wanted We would love the community's help completing this issue label Mar 12, 2025

krammnic mentioned this issue Apr 9, 2025

Eval chat template #2574

Open

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

No chat template in evaluation #2459

No chat template in evaluation #2459

xueyan-lii commented Mar 5, 2025

felipemello1 commented Mar 5, 2025 •

edited by joecummings

Loading

Uh oh!

krammnic commented Mar 11, 2025

Uh oh!

pbontrager commented Mar 12, 2025

Uh oh!

krammnic commented Apr 1, 2025

Uh oh!

joecummings commented Apr 1, 2025

Uh oh!

No chat template in evaluation #2459

No chat template in evaluation #2459

Comments

xueyan-lii commented Mar 5, 2025

felipemello1 commented Mar 5, 2025 • edited by joecummings Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krammnic commented Mar 11, 2025

Uh oh!

pbontrager commented Mar 12, 2025

Uh oh!

krammnic commented Apr 1, 2025

Uh oh!

joecummings commented Apr 1, 2025

Uh oh!

felipemello1 commented Mar 5, 2025 •

edited by joecummings

Loading