feat(providers): sambanova updated to use LiteLLM openai-compat #1596

jhpiedrahitao · 2025-03-12T20:33:05Z

What does this PR do?

switch sambanova inference adaptor to LiteLLM usage to simplify integration and solve issues with current adaptor when streaming and tool calling, models and templates updated

Test Plan

pytest -s -v tests/integration/inference/test_text_inference.py --stack-config=sambanova --text-model=sambanova/Meta-Llama-3.3-70B-Instruct

pytest -s -v tests/integration/inference/test_vision_inference.py --stack-config=sambanova --vision-model=sambanova/Llama-3.2-11B-Vision-Instruct

ashwinb

lgtm

ashwinb

You have listed vision models in the supported models for sambanova also. Can your test plan show the outputs for executing the entire integration suite (pytest -s -v tests/integration) please?

jhpiedrahitao · 2025-03-13T14:50:38Z

Hi @ashwinb. I'll let one comment for each test plan

for test_text_inference:

here, 3 test are failing
1- chat_completion:ttft: error when empty tool_call in content
2- chat_completion:structured_output: reason sambanova does not support yet structured ouput with json_Schema yet 
3- chat_completion:tool_calling_tools_absent-False]: error when empty tool_call in content

> The error when empty tool_call in content (1 and 2) is because sambanova API does not admit an empty tool call; this will be fixed at API level soon

jhpiedrahitao · 2025-03-13T15:18:28Z

for test_text_inference:

2 test faling
reasons
1- sambanova api does not support direct url sending; it is required to always sent the image in b64
2- when a turn in the convesation has more than one element in the content:

    message = {
        "role": "user",
        "content": [
            {
                "type": "image",
                "image": {
                    "url": {
                        "uri": "https://raw.githubusercontent.com/meta-llama/llama-stack/main/tests/api/inference/dog.png"
                    },
                },
            },
            {
                "type": "text",
                "text": "Describe what is in this image.",
            },
        ],
    }

instead of multiple messages each one with its content llamastack is failig, how ever this is working directly with litellm:

test manually in separate messages in llmastack working:

in this one, I'm guessing can be something odd in how the lite llm util is getting/sending the messages when multiple elements in same content.

jhpiedrahitao · 2025-03-13T15:23:47Z

Finally, remote::sambanova doesn't support embedding model yet

jhpiedrahitao · 2025-03-18T15:50:20Z

@ashwinb I found the reason of some test failures (multimodal whith multiple content type elements inside content),
looks like there is an issue when using the litellm litellm_openai_mixin util for inference adaptors in some cases (when the content of a message has multiple content items: image_urls, texts), the content items are inside a list each one, this structure is problematic for lite llm, given their implementation tries to search the type with a .get("type"), but because the actual content items dicts are in a list we get this error

this is because in the providers.utlis.inference.openai_compat::_convert_message_content, if the content is not string or list it is put into a list, and in the case of having content items those are dicts so as result we get in content a list of lists (each with only one dict: the content item), do you know the reason of putting the content dicts into a list, could we simply change the return to also return the element with out putting it into a list if their type is dict?

the issue was solved on PR 1150

jhpiedrahitao · 2025-03-18T21:04:11Z

cc @snova-luiss

ashwinb · 2025-03-18T21:34:52Z

2- chat_completion:structured_output: reason sambanova does not support yet structured ouput with json_Schema yet 

does it support structured outputs with other schemas? when do you expect this feature to be available?

ashwinb · 2025-03-18T21:36:40Z

1- sambanova api does not support direct url sending; it is required to always sent the image in b64

in that case, we should download the image in the sambanova adapter, encode it and send it as base64 downstream. we do this in a couple places (I believe maybe in the ollama adapter). we don't want the clients to be aware of this restriction unnecessarily when the Stack can hide it transparently.

jhpiedrahitao · 2025-04-07T13:53:27Z

Hi @ashwinb, could you please take a look

jhpiedrahitao · 2025-04-08T12:51:16Z

@ehhuang this PR was already modifying sambanova to use lite llm + some extra modifications to include support for things like api_key, and base url setting, support for url images, etc. this was done because current version was broken for several functionalities like streaming or tool call, should I include these changes to the new folder you have created in remote providers (opeaai compat)?, or can this be merged in parallel, given the template is still pointing to the previous folder?

ashwinb · 2025-04-12T00:17:08Z

@jhpiedrahitao Hey sorry about keeping this so stale. Could you resolve conflicts one last time? I will merge it after that.

jhpiedrahitao · 2025-04-12T00:41:17Z

@jhpiedrahitao Hey sorry about keeping this so stale. Could you resolve conflicts one last time? I will merge it after that.

Hi @ashwinb thanks, no problem, conflicts solved 👍🏻

…ova_usage' into feat/litellm_sambanova_usage

jhpiedrahitao · 2025-04-21T14:29:37Z

Hi @ashwinb were you able to take a look in this one?

jhpiedrahitao · 2025-05-05T17:20:05Z

Hi @ashwinb tagging you again to check if this can be merged

…astack#1596) # What does this PR do? switch sambanova inference adaptor to LiteLLM usage to simplify integration and solve issues with current adaptor when streaming and tool calling, models and templates updated ## Test Plan pytest -s -v tests/integration/inference/test_text_inference.py --stack-config=sambanova --text-model=sambanova/Meta-Llama-3.3-70B-Instruct pytest -s -v tests/integration/inference/test_vision_inference.py --stack-config=sambanova --vision-model=sambanova/Llama-3.2-11B-Vision-Instruct

jhpiedrahitao added 3 commits March 12, 2025 15:08

Sambanova now using LiteLLM openai-compat, models and template updated

397eed9

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

e49bcd4

update dev template

bed93ec

jhpiedrahitao requested review from SLR722, ashwinb, dineshyv, dltn, ehhuang, hardikjshah, raghotham, sixianyi0721, terrytangyuan, vladimirivic and yanxi0830 as code owners March 12, 2025 20:33

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 12, 2025

jhpiedrahitao changed the title ~~feat(providers): sambanova updated to use LiteLLM openai-compat, also models and templates updated~~ feat(providers): sambanova updated to use LiteLLM openai-compat Mar 12, 2025

ashwinb approved these changes Mar 13, 2025

View reviewed changes

ashwinb requested changes Mar 13, 2025

View reviewed changes

jhpiedrahitao requested a review from ashwinb March 13, 2025 15:24

jhpiedrahitao added 2 commits March 17, 2025 09:42

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

716cb09

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

8f9c791

jhpiedrahitao added 2 commits March 18, 2025 11:50

fix contentype dicts into lists

0f68852

Merge branch 'main' into feat/litellm_sambanova_usage

5bd1bd3

Merge branch 'main' into feat/litellm_sambanova_usage

ec73b3d

Merge branch 'main' into feat/litellm_sambanova_usage

49bf421

jhpiedrahitao added 4 commits April 8, 2025 08:06

fix typo un sambanova llama4 scout model name

ecf84a3

add llama4 maverick to sambanova inference models

8e6b622

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

aff9e18

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

13c660f

Merge branch 'main' into feat/litellm_sambanova_usage

172a918

jhpiedrahitao added 4 commits April 14, 2025 08:51

Merge branch 'meta-llama:main' into feat/litellm_sambanova_usage

dd808a8

Merge branch 'main' into feat/litellm_sambanova_usage

488eb8f

update sambanova models

63e3c58

Merge remote-tracking branch 'refs/remotes/origin/feat/litellm_samban…

daf0c26

…ova_usage' into feat/litellm_sambanova_usage

jhpiedrahitao added 3 commits May 5, 2025 11:49

Merge branch 'main' into feat/litellm_sambanova_usage

b7f16ac

rm inline code_tool

c4add97

Merge branch 'main' into feat/litellm_sambanova_usage

2867983

jhpiedrahitao added 6 commits May 6, 2025 09:56

Merge branch 'main' into feat/litellm_sambanova_usage

21125f7

Merge branch 'main' into feat/litellm_sambanova_usage

f149d6a

Merge branch 'main' into feat/litellm_sambanova_usage

d42a9ea

fix typo

f592408

Merge branch 'main' into feat/litellm_sambanova_usage

c91c457

Merge branch 'main' into feat/litellm_sambanova_usage

05e2fa0

ashwinb approved these changes May 6, 2025

View reviewed changes

ashwinb merged commit b2b00a2 into llamastack:main May 6, 2025
2 checks passed

jhpiedrahitao deleted the feat/litellm_sambanova_usage branch May 20, 2025 19:48

feat(providers): sambanova updated to use LiteLLM openai-compat #1596

feat(providers): sambanova updated to use LiteLLM openai-compat #1596

Uh oh!

Conversation

jhpiedrahitao commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

jhpiedrahitao commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhpiedrahitao commented Mar 13, 2025

Uh oh!

jhpiedrahitao commented Mar 13, 2025

Uh oh!

jhpiedrahitao commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhpiedrahitao commented Mar 18, 2025

Uh oh!

ashwinb commented Mar 18, 2025

Uh oh!

ashwinb commented Mar 18, 2025

Uh oh!

jhpiedrahitao commented Apr 7, 2025

Uh oh!

jhpiedrahitao commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashwinb commented Apr 12, 2025

Uh oh!

jhpiedrahitao commented Apr 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhpiedrahitao commented Apr 21, 2025

Uh oh!

jhpiedrahitao commented May 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jhpiedrahitao commented Mar 12, 2025 •

edited

Loading

jhpiedrahitao commented Mar 13, 2025 •

edited

Loading

jhpiedrahitao commented Mar 18, 2025 •

edited

Loading

jhpiedrahitao commented Apr 8, 2025 •

edited

Loading

jhpiedrahitao commented Apr 12, 2025 •

edited

Loading