feat: Add responses and safety impl extra_body #3781

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

slekkala1 merged 18 commits into main from new-responses-and-safety

Oct 15, 2025

Contributor

slekkala1 commented Oct 10, 2025 •

edited

Loading

What does this PR do?

Have closed the previous PR due to merge conflicts with multiple PRs
Addressed all comments from #3768 (sorry for carrying over to this one)

Test Plan

Added UTs and integration tests

meta-cla bot added the CLA Signed label

slekkala1 marked this pull request as ready for review

October 10, 2025 22:25

slekkala1 requested review from ashwinb, bbrowning, ehhuang, franciscojavierarceo, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, terrytangyuan and yanxi0830 as code owners

October 10, 2025 22:25

ashwinb reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/openai_responses.py Outdated Show resolved Hide resolved

slekkala1 force-pushed the new-responses-and-safety branch 2 times, most recently from 76f0478 to 76b991c Compare

October 13, 2025 19:13

ehhuang reviewed

View reviewed changes

llama_stack/apis/agents/agents.py

    
                  :param type: The type/identifier of the guardrail.

                  """

                  type: str

Contributor

ehhuang Oct 14, 2025

just for my learning: what types are available?

Contributor Author

slekkala1 Oct 14, 2025

not sure about this part @ashwinb, I usually only know identifier for a shield, as such only supporting that for now. May be this is to allow more fields in future.

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/agents.py Outdated Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/utils.py Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/utils.py

    
                      if isinstance(guardrail, str):

                          guardrail_ids.append(guardrail)

                      elif isinstance(guardrail, ResponseGuardrailSpec):

                          guardrail_ids.append(guardrail.type)

Contributor

ehhuang Oct 14, 2025

this seems confusing: type being used as id. Is there a better way to name this?

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/utils.py Outdated Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py

    
                      # Input safety validation - check messages before processing

                      if self.guardrail_ids:

                          combined_text = interleaved_content_as_str([msg.content for msg in self.ctx.messages])

Contributor

ehhuang Oct 14, 2025

should we document somewhere that guardrails only apply to text input?

Contributor Author

slekkala1 Oct 14, 2025 •

edited

Loading

yes the shield + moderation apis dont support the image, this is known tech debt, I filed an issue for that before.

Contributor

ehhuang Oct 15, 2025

Is it in our user-facing documentation?

Contributor Author

slekkala1 Oct 15, 2025

yeah may be not, need to do that

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated Show resolved Hide resolved

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Show resolved Hide resolved

slekkala1 force-pushed the new-responses-and-safety branch 4 times, most recently from a9ebdfe to 1f7d52f Compare

October 15, 2025 16:57

ehhuang reviewed

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py

    
                      # Input safety validation - check messages before processing

                      if self.guardrail_ids:

                          combined_text = interleaved_content_as_str([msg.content for msg in self.ctx.messages])

Contributor

ehhuang Oct 15, 2025

Is it in our user-facing documentation?

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated Show resolved Hide resolved

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py

    
                                      )

                              # Collect content for final response

                              chat_response_content.append(chunk_choice.delta.content or "")

Contributor

ehhuang Oct 15, 2025

[Re: line +624]

this should be gated too?

See this comment inline on Graphite.

Contributor Author

slekkala1 Oct 15, 2025

yes added gating for reasoning content too

llama_stack/providers/inline/agents/meta_reference/responses/streaming.py Outdated

    
                                      sequence_number=self.sequence_number,

                                  )

                                  # Skip Emitting text content delta event if guardrails are configured, only emits chunks after guardrails are applied

                                  if not self.guardrail_ids:

Contributor

ehhuang Oct 15, 2025

is it acceptable to drop these chunks entirely? or should we queue and yield after guardrails pass

Contributor Author

slekkala1 Oct 15, 2025 •

edited

Loading

Good point! Queuing and streaming the deltas when chunk is safe.

llama_stack/providers/inline/agents/meta_reference/responses/utils.py Outdated Show resolved Hide resolved

llama_stack/providers/inline/agents/meta_reference/responses/utils.py Outdated Show resolved Hide resolved

slekkala1 force-pushed the new-responses-and-safety branch from 6bd5b4d to 8d10642 Compare

October 15, 2025 20:17

slekkala1 added 18 commits

October 15, 2025 14:06


          feat: Add responses and safety impl extra_body

ad4362e


          clean and fix tests

171fb71


          use guardrails and run_moderation api

c10db23


          fix tests and remove unwanted changes

06dcfd1


          add recordings

aeb9e4b


          add recording again

495d233


          fix tests

da07772


          clean

b5c08c7


          improve user message

f8861bc


          fix test

74be622


          address comments

9a4d3d7


          fix tests

6e02802


          add explicit types

eb7dceb


          skip emitting deltas

fc960a3


          run pre-commit

31105c4


          queue and stream events for safe chunk

ada18ec


          fix tests

f6eb124


          remove unwanted recordings

94b5df7

slekkala1 force-pushed the new-responses-and-safety branch from 8c900d7 to 94b5df7 Compare

October 15, 2025 21:06

ehhuang approved these changes

View reviewed changes

slekkala1 merged commit 99141c2 into main

21 of 22 checks passed

slekkala1 deleted the new-responses-and-safety branch

October 15, 2025 22:01

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

ashwinb ashwinb left review comments

ehhuang ehhuang approved these changes

yanxi0830 Awaiting requested review from yanxi0830 yanxi0830 is a code owner

hardikjshah Awaiting requested review from hardikjshah hardikjshah is a code owner

raghotham Awaiting requested review from raghotham raghotham is a code owner

terrytangyuan Awaiting requested review from terrytangyuan terrytangyuan is a code owner

leseb Awaiting requested review from leseb leseb is a code owner

bbrowning Awaiting requested review from bbrowning bbrowning is a code owner

reluctantfuturist Awaiting requested review from reluctantfuturist reluctantfuturist is a code owner

mattf Awaiting requested review from mattf mattf is a code owner

franciscojavierarceo Awaiting requested review from franciscojavierarceo franciscojavierarceo is a code owner

Labels