fix(nvidia-safety): correct NeMo Guardrails API endpoint #4202

r-bit-rry · 2025-11-20T11:23:14Z

This PR fixes issue #4189 where the NVIDIA safety provider was calling an incorrect API endpoint when communicating with NeMo Guardrails service.

Problem

The NVIDIA safety provider implementation was calling /v1/guardrail/checks, which does not exist in the NeMo Guardrails API. According to the NeMo Guardrails documentation and nvidia docs, the correct endpoint is /v1/chat/completions.

This caused:

500 Internal Server Error when using NVIDIA safety shields
Complete failure of safety filtering functionality
Inability to use NeMo Guardrails for content moderation

Solution

1. Fixed Endpoint (nvidia.py:144)

Before:

response = await self._guardrails_post(path="/v1/guardrail/checks", data=request_data)

After:

response = await self._guardrails_post(path="/v1/chat/completions", data=request_data)

2. Simplified Request Format (nvidia.py:140-143)

Before:

request_data = {
    "model": self.model,
    "messages": [{"role": message.role, "content": message.content} for message in messages],
    "temperature": self.temperature,
    "top_p": 1,
    "frequency_penalty": 0,
    "presence_penalty": 0,
    "max_tokens": 160,
    "stream": False,
    "guardrails": {
        "config_id": self.config_id,
    },
}

After:

request_data = {
    "config_id": self.config_id,
    "messages": [{"role": message.role, "content": message.content} for message in messages],
}

The simplified format matches the NeMo Guardrails API specification and removes unnecessary inference parameters that were meant for LLM completion, not safety checks.

Testing

Test Results

✅ All 10 tests passing:

Unit Tests (8/8):
  ✓ test_register_shield_with_valid_id
  ✓ test_register_shield_without_id
  ✓ test_run_shield_allowed
  ✓ test_run_shield_blocked
  ✓ test_run_shield_not_found
  ✓ test_run_shield_http_error
  ✓ test_init_nemo_guardrails
  ✓ test_init_nemo_guardrails_invalid_temperature

E2E Tests (2/2) (not part of this PR):
  ✓ test_nvidia_safety_with_correct_endpoint
  ✓ test_nemo_guardrails_api_endpoint_documentation

Manual Verification

The reproduction script demonstrates the fix:

$ python tests/integration/safety/reproduce_issue_4189.py

✓ SUCCESS: Issue #4189 has been FIXED!
  Response received without errors
  Violation detected: True
  Violation level: ViolationLevel.ERROR
  User message: Sorry I cannot do this.

✓ All tests passing!
  - Blocked messages are detected
  - Safe messages are allowed
  - No 500 errors from wrong endpoint

Validation Against NeMo Guardrails API

This fix aligns with the official NeMo Guardrails API specification:

Endpoint: POST /v1/chat/completions

Request Format:

{
  "config_id": "demo-config",
  "messages": [
    {
      "role": "user",
      "content": "Hello!"
    }
  ]
}

Response Format:

{
  "role": "assistant",
  "content": "Response text",
  "status": "allowed|blocked",
  "rails_status": {
    "reason": "...",
    "triggered_rails": [...]
  }
}

Breaking Changes

None. This is a bug fix that makes the implementation work as originally intended.

References

Issue: NVIDIA Safety Provider Calling Wrong Guardrails Endpoint #4189
NeMo Guardrails: https://github.com/NVIDIA/NeMo-Guardrails
Related File: src/llama_stack/providers/remote/safety/nvidia/nvidia.py

ashwinb · 2025-11-20T20:08:31Z

where is the mentioned reproduction script? Either this could be included as a link to a Gist, or better still -- the PR summary should perhaps either be hand edited to look more like what a human engineer would write (just do not include menial details which are irrelevant for a change of such small magnitude) or the LLM could be guided with this precise instruction so the generated Summary feels like that.

r-bit-rry · 2025-11-23T14:05:30Z

where is the mentioned reproduction script? Either this could be included as a link to a Gist, or better still -- the PR summary should perhaps either be hand edited to look more like what a human engineer would write (just do not include menial details which are irrelevant for a change of such small magnitude) or the LLM could be guided with this precise instruction so the generated Summary feels like that.

Thanks for the reply, I went ahead and did a further simulation vs local instances of nemo, and ollama backend to identify more issues with the guardrails calls and discrepancies vs the original nvidia documentation.
I'll add several more changes and the testing scripts via gist (as they require setting up environment with docker and are not part of the general infrastructure of llama-stack)

r-bit-rry · 2025-11-24T13:00:34Z

@ashwinb
The /v1/guardrail/checks endpoint does not exist in the current NeMo Guardrails codebase.

The server API implementation only includes two main endpoints:
Actual Endpoints
/v1/rails/configs - Returns the list of available guardrails configurations api.py:277-303

/v1/chat/completions - The main endpoint for chat completions with guardrails applied api.py:369-374

This contradicts the official documentation I mentioned earlier here.
I'm going to run a few more checks to see that the API truly behaves the way its expected, but I don't like the discrepancy with the official documentation, I'm going to run things through with the original writer of the issue to see if he notices similar behavior on his instances

mattf · 2025-11-24T14:07:12Z

@jiayin-nvidia @rmkraus can you shed some light on this?

r-bit-rry · 2025-11-24T20:10:14Z

Direct API calls were made to the running container to compare the behavior of the endpoint used by the code (/v1/chat/completions) versus the endpoint documented for checks (/v1/guardrail/checks).
The container that was used was only of nemo-guardrails, the original issue uses the openshift nim-operator, but after further inspection I observed there was no code to redirect or modify the body, and the original documentation for the NeMo Microservices redirects to the same guardrails documentation.

Documentation vs. Implementation: The changes do not match the online NVIDIA NeMo Microservices documentation (v25.11.0). The documentation specifies endpoints like /v1/guardrail/checks and nested guardrails configuration, whereas the code uses /v1/chat/completions and top-level config_id.
Implementation vs. Reality: The changes do match the actual behavior of the NeMo Guardrails server code and container images (nemoguardrails:latest).
Source Code Verification: Using DeepWiki to inspect nemoguardrails/server/api.py confirmed that:
The RequestBody class explicitly accepts config_id at the top level.
The ResponseBody class returns a messages list, not the OpenAI choices format.

Test 1: Code-Usage Endpoint (/v1/chat/completions)

Request:

POST http://localhost:8000/v1/chat/completions
Content-Type: application/json

{
  "config_id": "demo-self-check-input-output",
  "messages": [
    {
      "role": "user",
      "content": "You are stupid"
    }
  ]
}

Response:

Status: 200 OK
Body: {"messages":[{"role":"assistant","content":""}]}
The server accepted the request with a top-level config_id and returned an empty content string for the violation.

Test 2: Documented Check Endpoint (/v1/guardrail/checks)

Request:

POST http://localhost:8000/v1/guardrail/checks
Content-Type: application/json

{
  "guardrails": {
    "config_id": "demo-self-check-input-output"
  },
  "messages": [
    {
      "role": "user",
      "content": "You are stupid"
    }
  ]
}

Response:

Status: 405 Method Not Allowed
Body: {"detail":"Method Not Allowed"}

fix(nvidia-safety): correct NeMo Guardrails API endpoint

1458e88

r-bit-rry requested review from ashwinb, bbrowning, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners November 20, 2025 11:23

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 20, 2025

Merge branch 'main' into fix/nvidia-safety-provider-endpoint-4189

f8f2834

Merge branch 'main' into fix/nvidia-safety-provider-endpoint-4189

fedfcf9

r-bit-rry requested a review from cdoern as a code owner November 23, 2025 11:13

r-bit-rry and others added 3 commits November 24, 2025 22:17

further changes to nvidia.py

c580bae

Merge branch 'main' into fix/nvidia-safety-provider-endpoint-4189

8bd515b

precommit hook check and fixes

4d66041

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(nvidia-safety): correct NeMo Guardrails API endpoint #4202

fix(nvidia-safety): correct NeMo Guardrails API endpoint #4202

r-bit-rry commented Nov 20, 2025

Uh oh!

ashwinb commented Nov 20, 2025 •

edited

Loading

Uh oh!

r-bit-rry commented Nov 23, 2025

Uh oh!

r-bit-rry commented Nov 24, 2025

Uh oh!

mattf commented Nov 24, 2025

Uh oh!

r-bit-rry commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(nvidia-safety): correct NeMo Guardrails API endpoint #4202

Are you sure you want to change the base?

fix(nvidia-safety): correct NeMo Guardrails API endpoint #4202

Conversation

r-bit-rry commented Nov 20, 2025

Problem

Solution

1. Fixed Endpoint (nvidia.py:144)

2. Simplified Request Format (nvidia.py:140-143)

Testing

Test Results

Manual Verification

Validation Against NeMo Guardrails API

Breaking Changes

References

Uh oh!

ashwinb commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

r-bit-rry commented Nov 23, 2025

Uh oh!

r-bit-rry commented Nov 24, 2025

Uh oh!

mattf commented Nov 24, 2025

Uh oh!

r-bit-rry commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ashwinb commented Nov 20, 2025 •

edited

Loading