Skip to content

Enable/disable training on thinking tokens by configuration#117

Draft
mdawn65 wants to merge 1 commit intoNovaSky-AI:mainfrom
mdawn65:maggie/enable-training-on-thinking-tokens
Draft

Enable/disable training on thinking tokens by configuration#117
mdawn65 wants to merge 1 commit intoNovaSky-AI:mainfrom
mdawn65:maggie/enable-training-on-thinking-tokens

Conversation

@mdawn65
Copy link
Contributor

@mdawn65 mdawn65 commented Jul 26, 2025

Summary

This PR introduces a configuration option to enable or disable training on "thinking tokens" for Qwen3 models in the generator. Works on issue #104.

Changes

  • Modified get_custom_chat_template to accept a train_on_thinking_tokens flag.
  • Passed the train_on_thinking_tokens config from generator_cfg to get_custom_chat_template.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments