openapi gen return type fix for streaming/non-streaming #910

yanxi0830 · 2025-01-31T00:54:42Z

What does this PR do?

We need to change

/v1/inference/chat-completion:
    post:
      responses:
        '200':
          description: >-
            If stream=False, returns a ChatCompletionResponse with the full completion.
            If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk
          content:
            text/event-stream:
              schema:
                oneOf:
                  - $ref: '#/components/schemas/ChatCompletionResponse'
                  - $ref: '#/components/schemas/ChatCompletionResponseStreamChunk'

into

/v1/inference/chat-completion:
    post:
      responses:
        '200':
          description: >-
            If stream=False, returns a ChatCompletionResponse with the full completion.
            If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk
          content:
            text/event-stream:
              schema:
                $ref: '#/components/schemas/ChatCompletionResponseStreamChunk'
            application/json:
              schema:
                $ref: '#/components/schemas/ChatCompletionResponse'

Test Plan

Python

tested in SDK sync: Fix streaming/non-streaming return type differences llama-stack-client-python#108

Node

tested w/ https://gist.github.com/yanxi0830/b782f4b91e21dcccdfef8898ce55157e (SDK udpate follow up)

Sources

Please link relevant resources if necessary.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section?
Updated relevant documentation.
Wrote necessary unit or integration tests.

ashwinb

nice

terrytangyuan

LGTM. Should we run these SDK tests on a schedule?

yanxi0830 · 2025-01-31T01:58:35Z

LGTM. Should we run these SDK tests on a schedule?

Not on schedule currently, but @sixianyi0721 added workflow that you can manually trigger by commit here: https://github.com/meta-llama/llama-stack/blob/main/.github/workflows/tests.yml

# What does this PR do? - sync with llamastack/llama-stack#910 ## Test Plan LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_inference.py LLAMA_STACK_CONFIG="./llama_stack/templates/fireworks/run.yaml" pytest -v tests/inference/test_inference.py LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/agents/test_agents.py::test_agent_simple ## Sources Please link relevant resources if necessary. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Ran pre-commit to handle lint / formatting issues. - [ ] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [ ] Wrote necessary unit or integration tests.

fix gen streaming

b94d8e9

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 31, 2025

yanxi0830 added 3 commits January 30, 2025 16:55

Merge branch 'main' into fix-stream-non-stream

7bed288

openapi

1fd7ea8

generator

4291fc0

yanxi0830 mentioned this pull request Jan 31, 2025

Fix streaming/non-streaming return type differences llamastack/llama-stack-client-python#108

Merged

5 tasks

yanxi0830 marked this pull request as ready for review January 31, 2025 01:19

yanxi0830 requested review from dineshyv, dltn, hardikjshah, raghotham, sixianyi0721 and vladimirivic as code owners January 31, 2025 01:19

ashwinb approved these changes Jan 31, 2025

View reviewed changes

terrytangyuan approved these changes Jan 31, 2025

View reviewed changes

Merge branch 'main' into fix-stream-non-stream

9a6aac4

yanxi0830 merged commit 15dcc4e into main Jan 31, 2025
2 checks passed

yanxi0830 deleted the fix-stream-non-stream branch January 31, 2025 02:03

This was referenced Jan 31, 2025

update node sdk with streaming/non-streaming return type fix llamastack/llama-stack-client-typescript#4

Closed

update node sdk with streaming/non-streaming return type fix llamastack/llama-stack-client-typescript#5

Merged

vladimirivic mentioned this pull request Jan 31, 2025

Sync updates from stainless branch: main llamastack/llama-stack-client-python#115

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

openapi gen return type fix for streaming/non-streaming #910

openapi gen return type fix for streaming/non-streaming #910

Uh oh!

yanxi0830 commented Jan 31, 2025 •

edited

Loading

Uh oh!

ashwinb left a comment

Uh oh!

terrytangyuan left a comment

Uh oh!

yanxi0830 commented Jan 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

openapi gen return type fix for streaming/non-streaming #910

openapi gen return type fix for streaming/non-streaming #910

Uh oh!

Conversation

yanxi0830 commented Jan 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Sources

Before submitting

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

terrytangyuan left a comment

Choose a reason for hiding this comment

Uh oh!

yanxi0830 commented Jan 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

yanxi0830 commented Jan 31, 2025 •

edited

Loading