Skip to content

Enable configuring custom chat template #104

@tyler-griggs

Description

@tyler-griggs

Currently, the SkyRL Gym Generator masks out thinking tokens for Qwen3 models using the get_custom_chat_template hook, however, some users want to train Qwen3 and keep the thinking tokens.

Users want the ability to choose whether to train on thinking tokens and, more generally, want to be able to provide a custom chat template without forking the code.

TODOs

  • Provide easier configuration / hook for users to provide a custom chat template in the SkyRL Gym Generator.
  • Add configuration for using thought tokens or mask out thought tokens (using different custom templates).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions