feat(llm): Add custom HTTP headers support to ChatNVIDIA provider #1461

Pouyanpi · 2025-10-20T09:00:03Z

Description

Add custom HTTP headers support to the ChatNVIDIA class patch, enabling users to pass custom headers
(authentication tokens, request IDs, billing information, etc.) with all requests to NVIDIA AI endpoints.

example config

models:
  - type: main
    engine: nvidia_ai_endpoints
    model: meta/llama-3.1-8b-instruct
    parameters:
      base_url: http://localhost:8000/v1
      custom_headers:
        X-Model-Auth: test-bearer-token
        X-Request-ID: "12345"
        X-Billing-Project: project-abc
        X-Trace-ID: trace-xyz

The feat is verified on microservice side.

Implementation

Added custom_headers optional field to ChatNVIDIA class with Pydantic v2 compatibility
Implemented runtime method wrapping that intercepts _client.get_req() and _client.get_req_stream()
to merge custom headers with existing headers
Included automatic version detection to ensure compatibility with langchain-nvidia-ai-endpoints >=
0.3.0, with clear error messages for older versions
Works with both synchronous invoke() and streaming requests, fully compatible with VLM (Vision
Language Models)

Checklist

I've read the CONTRIBUTING guidelines.
I've updated the documentation if applicable.
I've added tests if applicable.
@mentions of the person or team responsible for reviewing proposed changes.

Add custom HTTP headers support to the ChatNVIDIA class patch, enabling users to pass custom headers (authentication tokens, request IDs, billing information, etc.) with all requests to NVIDIA AI endpoints. Implementation Approach - Added custom_headers optional field to ChatNVIDIA class with Pydantic v2 compatibility - Implemented runtime method wrapping that intercepts _client.get_req() and _client.get_req_stream() to merge custom headers with existing headers - Included automatic version detection to ensure compatibility with langchain-nvidia-ai-endpoints >= 0.3.0, with clear error messages for older versions - Works with both synchronous invoke() and streaming requests, fully compatible with VLM (Vision Language Models)

codecov-commenter · 2025-10-20T09:50:12Z

Codecov Report

❌ Patch coverage is 66.66667% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
.../providers/_langchain_nvidia_ai_endpoints_patch.py	66.66%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

greptile-apps

_{2 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

x

trebedea

Everything is good, just fix the RuntimeError message.

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py

greptile-apps

Greptile Overview

Greptile Summary

This review covers only the changes made since the last review, not the entire PR. The latest changes address previous feedback by (1) fixing the mutable default argument bug where dict = {} was replaced with dict = None in both wrapped_get_req and wrapped_get_req_stream functions, and (2) consolidating redundant RuntimeError conditions that checked for langchain-nvidia-ai-endpoints compatibility. The developer appears to have removed duplicate version checks and simplified the error handling logic. These changes integrate with NeMo-Guardrails' existing LLM provider patching mechanism (found in nemoguardrails/llm/providers/) which allows runtime customization of third-party provider clients without modifying the original library code.

Important Files Changed

Filename	Score	Overview
nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py	1/5	Fixed mutable default arguments but introduced critical duplicate `super().__init__()` calls and redundant nested `if self.custom_headers` checks

Confidence score: 1/5

This PR contains a critical bug that will cause immediate runtime issues and should not be merged without fixes
Score reflects the duplicate super().__init__(**kwargs) call on lines 64 and 66, redundant nested if self.custom_headers checks on lines 65 and 73, and unclear control flow in the __init__ method that suggests merge conflicts or copy-paste errors were not fully resolved
Pay close attention to the __init__ method (lines 63-81) - the duplicate initialization and nested conditionals must be fixed before this can safely merge

_{1 file reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py

greptile-apps

Greptile Overview

Greptile Summary

This review covers only the changes made since the last review, not the entire PR. The developer has addressed previous feedback by removing duplicate initialization code in the ChatNVIDIA class patch. Specifically, they eliminated the redundant super().__init__(**kwargs) call that was appearing twice and removed a nested if self.custom_headers check that was already covered by an outer conditional. These changes streamline the initialization logic for the custom HTTP headers feature without altering its functionality. The custom headers support allows users to pass authentication tokens, request IDs, and other metadata through YAML configuration to NVIDIA AI endpoints by intercepting and wrapping the internal _client.get_req() and _client.get_req_stream() methods.

Important Files Changed

Filename	Score	Overview
nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py	4/5	Removed duplicate `super().__init__()` call and redundant nested conditional check in ChatNVIDIA initialization

Confidence score: 4/5

This PR is safe to merge with minimal risk as it removes obvious code duplication
Score reflects that the changes are purely cleanup with no functional modifications, though the overall feature implementation was not fully reviewed here
No files require special attention; this is a straightforward bug fix addressing previous review comments

_{1 file reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py

Pouyanpi added 2 commits October 20, 2025 11:42

importorskip

4cb7771

Pouyanpi force-pushed the feat/custom-headers-nvidia branch from 0025970 to 4cb7771 Compare October 20, 2025 09:42

greptile-apps bot reviewed Oct 21, 2025

View reviewed changes

pragma no cover

a88b954

x

Pouyanpi force-pushed the feat/custom-headers-nvidia branch from a5f27cb to a88b954 Compare October 22, 2025 16:20

Pouyanpi marked this pull request as ready for review October 22, 2025 16:21

Pouyanpi requested review from tgasser-nv and trebedea October 23, 2025 06:48

trebedea approved these changes Oct 24, 2025

View reviewed changes

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Show resolved Hide resolved

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Outdated Show resolved Hide resolved

Pouyanpi added 2 commits October 24, 2025 14:42

update header patch for v0.3.0

15c45ec

improve error message

d0b4ef6

greptile-apps bot reviewed Oct 24, 2025

View reviewed changes

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Outdated Show resolved Hide resolved

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Outdated Show resolved Hide resolved

apply review suggestions

91eab43

greptile-apps bot reviewed Oct 24, 2025

View reviewed changes

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Show resolved Hide resolved

nemoguardrails/llm/providers/_langchain_nvidia_ai_endpoints_patch.py Show resolved Hide resolved

trebedea approved these changes Oct 24, 2025

View reviewed changes

Pouyanpi added this to the v0.18.0 milestone Oct 24, 2025

Pouyanpi self-assigned this Oct 24, 2025

Pouyanpi merged commit aafd733 into develop Oct 24, 2025
8 checks passed

Pouyanpi deleted the feat/custom-headers-nvidia branch October 24, 2025 12:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llm): Add custom HTTP headers support to ChatNVIDIA provider #1461

feat(llm): Add custom HTTP headers support to ChatNVIDIA provider #1461

Uh oh!

Pouyanpi commented Oct 20, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Oct 20, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

trebedea left a comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(llm): Add custom HTTP headers support to ChatNVIDIA provider #1461

feat(llm): Add custom HTTP headers support to ChatNVIDIA provider #1461

Uh oh!

Conversation

Pouyanpi commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Implementation

Checklist

Uh oh!

codecov-commenter commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

trebedea left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 1/5

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 4/5

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Pouyanpi commented Oct 20, 2025 •

edited

Loading

codecov-commenter commented Oct 20, 2025 •

edited

Loading