[Feature] Apply structured output sampling after reasoning steps in Reasoning models

### Checklist

- [ ] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- [ ] 2. Please use English, otherwise it will be closed.

### Motivation

Only apply constrained sampling only in the answer for reasoning model. i.e. for DeepSeek R1 only enforce grammar inside after `</think>`
This would make Reasoning models more useful in agent workflow expecting structured output.

### Related resources

https://github.com/vllm-project/vllm/issues/12619
https://github.com/vllm-project/vllm/pull/12955

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Apply structured output sampling after reasoning steps in Reasoning models #4055

Checklist

Motivation

Related resources

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Apply structured output sampling after reasoning steps in Reasoning models #4055

Description

Checklist

Motivation

Related resources

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions