[Bug]: Remove fallback to outlines for int/number range and pattern constraints in guided_json

### Your current environment

- vllm 0.8.3
- xgrammar

### 🐛 Describe the bug

Hi team,

I’d like to propose removing the current fallback mechanism to outlines for int/number range constraints and pattern-based validations in Guided JSON mode.

https://github.com/vllm-project/vllm/blob/ee378f3d49f1404a72ec0948f0a2553f7c3a3726/vllm/model_executor/guided_decoding/utils.py#L6-L23

Previously, vLLM defaulted to Outlines when encountering complex JSON schema elements (like pattern, minimum, or maximum) due to limitations in xgrammar. https://github.com/vllm-project/vllm/pull/10899


However, xgrammar has now added full support for these schema constraints, which makes this fallback no longer necessary.

- https://github.com/mlc-ai/xgrammar/pull/185
- https://github.com/mlc-ai/xgrammar/pull/289
- https://github.com/mlc-ai/xgrammar/blob/8fa47978e37970865a6630a9533f2e1db7dc8f46/cpp/json_schema_converter.cc#L1645-L1655

The main motivation behind this PR is not only to simplify and unify the backend behavior, but also to address a serious performance concern. When fallback to Outlines occurs in production, we've experienced:

- **Significantly higher memory usage**, leading to OOM errors, and
- **Much slower response times**, which in some cases rendered Guided JSON unusable for real-time services.

Additionally, this proposal is strongly inspired by the recent PR [[Bugfix][v1] xgrammar structured output supports Enum](https://github.com/vllm-project/vllm/pull/15594), which brought long-awaited support for enum in xgrammar. Just as that change allowed us to move away from unnecessary fallbacks for enum types, we believe it's now equally reasonable and timely to eliminate fallbacks related to numeric ranges and regex patterns as well.

This change would allow Guided JSON to be more reliable and performant under real-world workloads, especially when schemas include numeric ranges or regex patterns.

Looking forward to your feedback!

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	def has_xgrammar_unsupported_json_features(schema: dict) -> bool:
	"""Check if JSON schema contains features unsupported by xgrammar."""

	def check_object(obj: dict) -> bool:
	if not isinstance(obj, dict):
	return False

	# Check for pattern restrictions
	if "pattern" in obj:
	return True

	# Check for numeric ranges
	if obj.get("type") in ("integer", "number") and any(
	key in obj for key in [
	"minimum", "maximum", "exclusiveMinimum",
	"exclusiveMaximum", "multipleOf"
	]):
	return True

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Remove fallback to outlines for int/number range and pattern constraints in guided_json #16723

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Remove fallback to outlines for int/number range and pattern constraints in guided_json #16723

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions