chore: deprecate duplicate params in nvext #2754

ryan-lempka · 2025-08-28T00:04:44Z

Overview:

Deprecate parameters that are available both under nvext and at the top level of the request. Going forward engine-level args should be placed at the top-level.

This removes duplication within the API and aligns with the conventions most common today in API schemas associated with backends like vLLM and SGLang.

Details:

Parameters deprecated
nvext.ignore_eos -> Use ignore_eos
nvext.guided_json -> Use guided_json
nvext.guided_regex -> Use guided_regex
nvext.guided_grammar -> Use guided_grammar
nvext.guided_choice -> Use guided_choice
nvext.guided_decoding_backend -> Use guided_decoding_backend

Changes:

Added choose_with_deprecation() helper for consistent deprecation handling
Server logs warnings when deprecated nvext parameters are used
Top-level parameters take precedence (existing behavior)
Backward compatible – both versions still work

Next Steps:

Follow-on MR: Add top_k and repetition_penalty to top-level and deprecate in nvext
0.6.0: Remove deprecated nvext parameters entirely

Where should the reviewer start?

Review the helper functions added to emit deprecation warnings and that all parameters to be deprecated have been covered.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

Summary by CodeRabbit

New Features
- Deprecation-aware handling for guidance options and ignore_eOS across Completions and Chat Completions.
- Clear precedence: top-level settings now take priority over nested overrides.
- User-facing warnings when deprecated nested fields are used.
- Unified logic for selecting effective values, including stop-condition behavior.
Documentation
- Clarified precedence rules and deprecation behavior for guidance and stop-condition settings.
Tests
- Added unit tests to validate precedence and deprecation handling.

coderabbitai · 2025-08-28T00:07:57Z

Walkthrough

Introduces deprecation-aware helpers for resolving overlapping CommonExt vs NvExt fields and applies them to guided options and ignore_eos across OpenAI completions and chat completions. Adds get_ignore_eos accessors, centralizes precedence (CommonExt over NvExt), and emits deprecation warnings when NvExt values are considered. Updates default ignore_eos resolution in protocols.

Changes

Cohort / File(s)	Summary
Deprecation helpers (CommonExt/NvExt resolution) `lib/llm/src/protocols/openai/common_ext.rs`	Adds `emit_nvext_deprecation_warning` and `choose_with_deprecation<T>`; implements tests for precedence and fallback behavior.
Completions: guided options and stop conditions `lib/llm/src/protocols/openai/completions.rs`	Refactors guided option getters to use `choose_with_deprecation`; `get_guided_json` now warns on NvExt usage; adds `get_ignore_eos` via `OpenAIStopConditionsProvider` using precedence (CommonExt over NvExt).
Chat Completions: guided options and stop conditions `lib/llm/src/protocols/openai/chat_completions.rs`	Applies `choose_with_deprecation` to guided option getters; emits deprecation warnings; adds `get_ignore_eos` in `OpenAIStopConditionsProvider` with CommonExt-first resolution.
Protocol default behavior `lib/llm/src/protocols/openai.rs`	Replaces `get_ignore_eos` default with a call to `choose_with_deprecation`, preserving CommonExt precedence and deprecation-aware handling.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor Caller as Provider Getter
  participant CE as CommonExt (root)
  participant NV as NvExt
  participant Helper as choose_with_deprecation()
  participant Log as Deprecation Logger

  Caller->>Helper: resolve(field, CE.value?, NV.value?)
  alt NV has value
    Helper->>Log: emit_nvext_deprecation_warning(field, nv=Some, common=CE.is_some())
  end
  alt CE has value
    Helper-->>Caller: return CE.value
  else NV has value
    Helper-->>Caller: return NV.value
  else none
    Helper-->>Caller: return None
  end

sequenceDiagram
  autonumber
  actor API as Request (NvCreate*Request)
  participant Provider as OpenAIStopConditionsProvider
  participant Helper as choose_with_deprecation()
  note over API,Provider: Compute effective ignore_eos
  API->>Provider: get_ignore_eos()
  Provider->>Helper: (\"ignore_eos\", common_ignore_eos?, nvext_ignore_eos?)
  Helper-->>Provider: Option<bool>
  Provider-->>API: Option<bool>

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

feat: Add frontend support for min_tokens and ignore_eos (outside of nvext) and Structured Output / Guided Decoding #2380 — Earlier changes to CommonExt/NvExt handling and ignore_eos; this PR layers deprecation-aware resolution atop that logic.
chore: guided decoding support for nvext #2339 — Introduced guided_decoding fields and extraction; this PR refactors selection to use a shared deprecation-aware helper.

Poem

A rabbit taps keys with delicate paws,
Choosing roots over branches, obeying the laws.
If NvExt whispers, a warning will chime—
Common comes first, every single time.
Stop at EOS? Or gently ignore?
Now one helper decides—clean, to the core. 🐇✨

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbit in a new review comment at the desired location with your query.
PR comments: Tag @coderabbit in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbit gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbit read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbit help to get the list of available commands.

Other keywords and placeholders

Add @coderabbit ignore or @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbit summary or @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbit or @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

Signed-off-by: Ryan Lempka <rlempka@nvidia.com>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (8)

lib/llm/src/protocols/openai/common_ext.rs (3)

70-85: Centralized deprecation warnings: add a tracing target for easy filtering

Recommend adding a target so downstreams can silence or route these warnings.

-        tracing::warn!(
+        tracing::warn!(target: "deprecations",
             "DEPRECATION WARNING: 'nvext.{field_name}' is deprecated and will be removed in a future release. Use '{field_name}' at the top level or in 'extra_body' instead."
         );
...
-        tracing::warn!(
+        tracing::warn!(target: "deprecations",
             "DEPRECATION WARNING: 'nvext.{field_name}' is deprecated and will be removed in a future release. Top-level '{field_name}' takes precedence. Use '{field_name}' at the top level or in 'extra_body' instead."
         );

87-98: Helper looks solid; consider a ref variant to unify guided_json handling

Adding a reference-returning variant avoids bespoke code paths for ref-based getters.

 pub fn choose_with_deprecation<T: Clone>(
@@
     common.cloned().or_else(|| nv.cloned())
 }
+
+/// Reference-returning variant for fields whose getters return references.
+pub fn choose_ref_with_deprecation<'a, T>(
+    field: &'static str,
+    common: Option<&'a T>,
+    nv: Option<&'a T>,
+) -> Option<&'a T> {
+    if nv.is_some() {
+        emit_nvext_deprecation_warning(field, true, common.is_some());
+    }
+    common.or(nv)
+}

195-213: Unit tests for precedence — LGTM

Covers all three cases (common wins, nvext fallback, both None). Consider (optional) a log-assert test in a follow-up if you want to guarantee deprecation emits.

lib/llm/src/protocols/openai.rs (1)

64-68: Default get_ignore_eos centralization — LGTM; remove redundant per-type overrides

This default covers all providers. You can drop identical overrides in completions.rs and chat_completions.rs to reduce duplication.

lib/llm/src/protocols/openai/completions.rs (2)

148-153: guided_json: correct precedence and warning; consider using ref helper if added

If you add choose_ref_with_deprecation, you can simplify this to one call.

-        // Note: This one needs special handling since it returns a reference
-        if let Some(nvext) = &self.nvext
-            && nvext.guided_json.is_some()
-        {
-            emit_nvext_deprecation_warning("guided_json", true, self.common.guided_json.is_some());
-        }
-        self.common
-            .guided_json
-            .as_ref()
-            .or_else(|| self.nvext.as_ref().and_then(|nv| nv.guided_json.as_ref()))
+        choose_ref_with_deprecation(
+            "guided_json",
+            self.common.guided_json.as_ref(),
+            self.nvext.as_ref().and_then(|nv| nv.guided_json.as_ref()),
+        )

218-226: Redundant get_ignore_eos override; rely on trait default

Same logic as the default in openai.rs; recommend removing to avoid drift.

-    /// Get the effective ignore_eos value, considering both CommonExt and NvExt.
-    /// CommonExt (root-level) takes precedence over NvExt.
-    fn get_ignore_eos(&self) -> Option<bool> {
-        choose_with_deprecation(
-            "ignore_eos",
-            self.get_common_ignore_eos().as_ref(),
-            NvExtProvider::nvext(self).and_then(|nv| nv.ignore_eos.as_ref()),
-        )
-    }

lib/llm/src/protocols/openai/chat_completions.rs (2)

154-159: guided_json: correct precedence and warning; consider ref helper to reduce bespoke code

Mirrors the completions path; can be simplified with choose_ref_with_deprecation if added.

-        // Note: This one needs special handling since it returns a reference
-        if let Some(nvext) = &self.nvext
-            && nvext.guided_json.is_some()
-        {
-            emit_nvext_deprecation_warning("guided_json", true, self.common.guided_json.is_some());
-        }
-        self.common
-            .guided_json
-            .as_ref()
-            .or_else(|| self.nvext.as_ref().and_then(|nv| nv.guided_json.as_ref()))
+        choose_ref_with_deprecation(
+            "guided_json",
+            self.common.guided_json.as_ref(),
+            self.nvext.as_ref().and_then(|nv| nv.guided_json.as_ref()),
+        )

246-251: Redundant get_ignore_eos override; rely on trait default

Matches the default implementation; safe to remove.

-    /// Get the effective ignore_eos value, considering both CommonExt and NvExt.
-    /// CommonExt (root-level) takes precedence over NvExt.
-    fn get_ignore_eos(&self) -> Option<bool> {
-        choose_with_deprecation(
-            "ignore_eos",
-            self.get_common_ignore_eos().as_ref(),
-            NvExtProvider::nvext(self).and_then(|nv| nv.ignore_eos.as_ref()),
-        )
-    }

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 9e6972a and 618c96d.

📒 Files selected for processing (4)

lib/llm/src/protocols/openai.rs (2 hunks)
lib/llm/src/protocols/openai/chat_completions.rs (3 hunks)
lib/llm/src/protocols/openai/common_ext.rs (2 hunks)
lib/llm/src/protocols/openai/completions.rs (3 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-08-22T19:55:41.608Z

Learnt from: nachiketb-nvidia
PR: ai-dynamo/dynamo#2656
File: lib/llm/src/protocols/openai/chat_completions/delta.rs:320-327
Timestamp: 2025-08-22T19:55:41.608Z
Learning: There are two separate DeltaGenerator classes in the codebase: one for chat completions (lib/llm/src/protocols/openai/chat_completions/delta.rs with object "chat.completion.chunk") and one for text completions (lib/llm/src/protocols/openai/completions/delta.rs with object "text_completion"). They have different create_choice method signatures and serve different OpenAI API endpoints. The reasoning parsing functionality is only relevant to the chat completions DeltaGenerator.

Applied to files:

lib/llm/src/protocols/openai/completions.rs
lib/llm/src/protocols/openai/chat_completions.rs

🧬 Code graph analysis (3)

lib/llm/src/protocols/openai.rs (3)

lib/llm/src/protocols/openai/chat_completions.rs (1)

common_ext (148-150)

lib/llm/src/protocols/openai/common_ext.rs (2)

common_ext (60-60)

choose_with_deprecation (88-97)

lib/llm/src/protocols/openai/completions.rs (1)

common_ext (142-144)

lib/llm/src/protocols/openai/completions.rs (2)

lib/llm/src/protocols/openai/common_ext.rs (3)

common_ext (60-60)

choose_with_deprecation (88-97)

emit_nvext_deprecation_warning (71-85)

lib/llm/src/protocols/openai.rs (3)

nvext (43-43)

nvext (53-53)

get_ignore_eos (63-69)

lib/llm/src/protocols/openai/chat_completions.rs (2)

lib/llm/src/protocols/openai/common_ext.rs (7)

common_ext (60-60)

choose_with_deprecation (88-97)

emit_nvext_deprecation_warning (71-85)

get_guided_regex (64-64)

get_guided_grammar (65-65)

get_guided_choice (66-66)

get_guided_decoding_backend (67-67)

lib/llm/src/protocols/openai/completions.rs (9)

common_ext (142-144)

nvext (88-90)

nvext (136-138)

nvext (210-212)

get_guided_regex (160-166)

get_guided_grammar (168-176)

get_guided_choice (178-184)

get_guided_decoding_backend (186-194)

get_ignore_eos (220-226)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Build and Test - dynamo
GitHub Check: pre-merge-rust (lib/bindings/python)
GitHub Check: pre-merge-rust (.)

🔇 Additional comments (13)

lib/llm/src/protocols/openai.rs (3)

11-11: Import of deprecation helper — LGTM

146-168: StopConditions extraction uses unified ignore_eos — LGTM

Precedence and validation are preserved.

62-68: All deprecation helpers correctly applied

Every get_guided_* getter in both completions.rs (§160–188) and chat_completions.rs (§166–194) uses the choose_with_deprecation helper.

Both get_guided_json implementations emit an NV-ext deprecation warning via emit_nvext_deprecation_warning.

The get_ignore_eos method in openai.rs dispatches through choose_with_deprecation for the “ignore_eos” field.

No further changes required.

lib/llm/src/protocols/openai/completions.rs (5)

27-29: Scoped imports for helpers — LGTM

160-166: guided_regex via helper — LGTM

168-176: guided_grammar via helper — LGTM

178-184: guided_choice via helper — LGTM

186-194: guided_decoding_backend via helper — LGTM

lib/llm/src/protocols/openai/chat_completions.rs (5)

24-26: Scoped imports for helpers — LGTM

167-171: guided_regex via helper — LGTM

175-181: guided_grammar via helper — LGTM

185-189: guided_choice via helper — LGTM

193-199: guided_decoding_backend via helper — LGTM

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Jason Zhou <jasonzho@jasonzho-mlt.client.nvidia.com>

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Michael Shin <michaelshin@users.noreply.github.com>

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: nnshah1 <neelays@nvidia.com>

ryan-lempka requested review from grahamking, nnshah1 and paulhendricks August 28, 2025 00:04

ryan-lempka requested a review from a team as a code owner August 28, 2025 00:04

pull-request-size bot added the size/L label Aug 28, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 00:04 Inactive

ryan-lempka self-assigned this Aug 28, 2025

github-actions bot added the chore label Aug 28, 2025

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 00:05 Inactive

ryan-lempka force-pushed the rlempka/deprecate-duplicate-params-nvext branch from 152a9b1 to 6c5a9da Compare August 28, 2025 00:06

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 00:06 Inactive

chore: deprecate duplicate params in nvext

618c96d

Signed-off-by: Ryan Lempka <rlempka@nvidia.com>

ryan-lempka force-pushed the rlempka/deprecate-duplicate-params-nvext branch from 6c5a9da to 618c96d Compare August 28, 2025 00:10

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 00:10 Inactive

copy-pr-bot bot temporarily deployed to GITLAB August 28, 2025 00:16 Inactive

coderabbitai bot reviewed Aug 28, 2025

View reviewed changes

paulhendricks approved these changes Aug 28, 2025

View reviewed changes

ryan-lempka merged commit e3619ce into main Aug 28, 2025
15 of 16 checks passed

ryan-lempka deleted the rlempka/deprecate-duplicate-params-nvext branch August 28, 2025 04:25

This was referenced Aug 28, 2025

chore: deprecate nvext.top_k and nvext.repetition_penalty and make available top level #2767

Merged

[FEATURE]: Remove duplicate parameters under nvext in 0.6.0 #2781

Closed

jasonqinzhou pushed a commit that referenced this pull request Aug 30, 2025

chore: deprecate duplicate params in nvext (#2754)

9e6f472

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Jason Zhou <jasonzho@jasonzho-mlt.client.nvidia.com>

michaelshin pushed a commit that referenced this pull request Sep 2, 2025

chore: deprecate duplicate params in nvext (#2754)

937b968

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Michael Shin <michaelshin@users.noreply.github.com>

KrishnanPrash pushed a commit that referenced this pull request Sep 2, 2025

chore: deprecate duplicate params in nvext (#2754)

e6b0e21

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: Krishnan Prashanth <kprashanth@nvidia.com>

nnshah1 pushed a commit that referenced this pull request Sep 8, 2025

chore: deprecate duplicate params in nvext (#2754)

1ab5151

Signed-off-by: Ryan Lempka <rlempka@nvidia.com> Signed-off-by: nnshah1 <neelays@nvidia.com>

coderabbitai bot mentioned this pull request Sep 11, 2025

feat: add chat_template_kwargs param to v1/chat/completion #3016

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: deprecate duplicate params in nvext #2754

chore: deprecate duplicate params in nvext #2754

Uh oh!

ryan-lempka commented Aug 28, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Aug 28, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chore: deprecate duplicate params in nvext #2754

chore: deprecate duplicate params in nvext #2754

Uh oh!

Conversation

ryan-lempka commented Aug 28, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ryan-lempka commented Aug 28, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 28, 2025 •

edited

Loading