Checklist
Motivation
Only apply constrained sampling only in the answer for reasoning model. i.e. for DeepSeek R1 only enforce grammar inside after </think>
This would make Reasoning models more useful in agent workflow expecting structured output.
Related resources
vllm-project/vllm#12619
vllm-project/vllm#12955