feat(llm): add LLM profiles #1843

all-hands-bot · 2026-02-02T23:56:13Z

🔴 Critical: This documentation is misleading and dangerous. The current default behavior (include_secrets=True) could lead users to accidentally commit API keys.

Suggested change

Set ``LLM_PROFILE_NAME`` to choose which profile file to load.

Notes on credentials:

- New profiles include API keys by default when saved

- To omit secrets on disk, pass include_secrets=False to LLMRegistry.save_profile

Set ``LLM_PROFILE_NAME`` to choose which profile file to load.

Security Best Practice:

- Profiles should be saved WITHOUT secrets (include_secrets=False)

- Provide API keys via environment variables (LLM_API_KEY, AWS_ACCESS_KEY_ID, etc.)

- Never commit profile files containing secrets to version control

- Add *.json to .gitignore if storing profiles in your project directory

Same as below

all-hands-bot · 2026-02-02T23:56:13Z

🟠 Important: The create() method has become quite complex with the profile reference logic. Consider extracting the resume logic into a separate _resume_from_persistence() method to improve readability.

The multiple payload mutations (expanding profile refs, injecting runtime agent, converting back to profile refs) make this hard to follow and maintain.

-Original file line number
+Diff line change
@@ Expand Up / @@ -29,7 +29,12 @@ def find_documented_examples(docs_path: Path) -> set[str]: @@
         """
         documented_examples: set[str] = set()
-        # Pattern to match example file references with arbitrary nesting depth.
+        # Pattern to match example file references.
+        #
+        # The agent-sdk examples tree includes nested modules (e.g.
+        # examples/02_remote_agent_server/05_custom_tool/custom_tools/log_data.py),
+        # so we intentionally support *arbitrary* nesting depth under examples/.
+        #
         # Matches: examples/<dir>/.../<file>.py
         pattern = r"examples/(?:[-\w]+/)+[-\w]+\.py"
@@ Expand Down Expand Up @@
                     if relative_path_str.startswith("examples/03_github_workflows/"):
                         continue
-                    # Skip LLM-specific tools examples: these are intentionally not
-                    # enforced by the docs check. See discussion in PR #1486.
+                    # Skip LLM-specific tools examples: these depend on external
+                    # model/provider availability and are intentionally excluded from
+                    # docs example enforcement.
                     if relative_path_str.startswith("examples/04_llm_specific_tools/"):
                         continue
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up / @@ -167,6 +167,9 @@ mkdir -p .pr @@
     - Avoid getattr/hasattr guards and instead enforce type correctness by relying on explicit type assertions and proper object usage, ensuring functions only receive the expected Pydantic models or typed inputs. Prefer type hints and validated models over runtime shape checks.
     - Prefer accessing typed attributes directly. If necessary, convert inputs up front into a canonical shape; avoid purely hypothetical fallbacks.
     - Use real newlines in commit messages; do not write literal "\n".
+    ## Example Scripts
+    - Example scripts in `examples/` should run code directly at module level without wrapping in `if __name__ == "__main__":` guards. This saves a level of indentation and keeps examples concise.
     </CODE>
     <TESTING>
@@ Expand Down @@

-Original file line number
+Diff line change
@@ -0,0 +1,7 @@
+    {
+      "model": "litellm_proxy/openai/gpt-5-mini",
+      "base_url": "https://llm-proxy.eval.all-hands.dev",
+      "temperature": 0.2,
+      "max_output_tokens": 4096,
+      "usage_id": "agent"
+    }

-Original file line number
+Diff line change
@@ Expand Up / @@ -420,6 +420,7 @@ def model_dump_succint(self, **kwargs): @@
             """Like model_dump, but excludes None fields by default."""
             if "exclude_none" not in kwargs:
                 kwargs["exclude_none"] = True
             dumped = super().model_dump(**kwargs)
             # remove tool schema details for brevity
             if "tools" in dumped and isinstance(dumped["tools"], dict):
@@ Expand Down @@

-Original file line number
+Diff line change
@@ Expand Up / @@ -133,6 +133,12 @@ def __init__( @@
                        decrypted when loading. If not provided, secrets are redacted
                        (lost) on serialization.
             """
+            # Initialize the registry early so profile references resolve during resume.
+            # The registry must exist before ConversationState.create() attempts to load
+            # persisted state that may contain profile_ref payloads.
+            self.llm_registry = LLMRegistry()
+            self.llm_registry = LLMRegistry()
             super().__init__()  # Initialize with span tracking
             # Mark cleanup as initiated as early as possible to avoid races or partially
             # initialized instances during interpreter shutdown.
@@ Expand Down Expand Up / @@ -169,6 +175,7 @@ def __init__( @@
                 else None,
                 max_iterations=max_iteration_per_run,
                 stuck_detection=stuck_detection,
+                llm_registry=self.llm_registry,
                 cipher=cipher,
             )
@@ Expand Down Expand Up / @@ -234,7 +241,6 @@ def _default_callback(e): @@
             # Agent initialization is deferred to _ensure_agent_ready() for lazy loading
             # This ensures plugins are loaded before agent initialization
-            self.llm_registry = LLMRegistry()
             # Initialize secrets if provided
             if secrets:
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(llm): add LLM profiles #1843

Uh oh!

Diff view

Diff view

There are no files selected for viewing

all-hands-bot Feb 2, 2026

Uh oh!

enyst Feb 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

all-hands-bot Feb 2, 2026

Uh oh!

Uh oh!

Uh oh!

-Original file line number
+Diff line change
@@ Expand Up / @@ -3,7 +3,7 @@ @@
     from collections.abc import Sequence
     from enum import Enum
     from pathlib import Path
-    from typing import Any, Self
+    from typing import TYPE_CHECKING, Any, Self
     from pydantic import Field, PrivateAttr, model_validator
@@ Expand All / @@ -18,6 +18,12 @@ @@
     from openhands.sdk.event.base import Event
     from openhands.sdk.io import FileStore, InMemoryFileStore, LocalFileStore
     from openhands.sdk.logger import get_logger
+    if TYPE_CHECKING:
+        from openhands.sdk.llm.llm_registry import LLMRegistry
     from openhands.sdk.security.analyzer import SecurityAnalyzerBase
     from openhands.sdk.security.confirmation_policy import (
         ConfirmationPolicyBase,
@@ Expand Down Expand Up / @@ -181,8 +187,15 @@ def _save_base_state(self, fs: FileStore) -> None: @@
                     "redacted and lost on restore. Consider providing a cipher to "
                     "preserve secrets."
                 )
-            payload = self.model_dump_json(exclude_none=True, context=context)
-            fs.write(BASE_STATE, payload)
+            payload = self.model_dump(
+                mode="json",
+                exclude_none=True,
+                context={**(context or {}), "persist_profile_ref": True},
+            )
+            if self.agent.llm.profile_id and self.agent.llm.profile_ref:
+                payload["agent"]["llm"] = self.agent.llm.to_profile_ref()
+            fs.write(BASE_STATE, json.dumps(payload))
         # ===== Factory: open-or-create (no load/save methods needed) =====
         @classmethod
@@ Expand All / @@ -194,6 +207,7 @@ def create( @@
             persistence_dir: str | None = None,
             max_iterations: int = 500,
             stuck_detection: bool = True,
+            llm_registry: "LLMRegistry | None" = None,
             cipher: Cipher | None = None,
         ) -> "ConversationState":
             """Create a new conversation state or resume from persistence.
@@ Expand All / @@ -211,13 +225,19 @@ def create( @@
             history), but all other configuration can be freely changed: LLM,
             agent_context, condenser, system prompts, etc.
+            When conversation state is persisted with LLM profile references (instead
+            of inlined credentials), pass an ``llm_registry`` so profile IDs can be
+            expanded during restore.
             Args:
                 id: Unique conversation identifier
                 agent: The Agent to use (tools must match persisted on restore)
                 workspace: Working directory for agent operations
                 persistence_dir: Directory for persisting state and events
                 max_iterations: Maximum iterations per run
                 stuck_detection: Whether to enable stuck detection
+                llm_registry: Optional registry used to expand profile references when
+                    conversations persist profile IDs instead of inline credentials.
                 cipher: Optional cipher for encrypting/decrypting secrets in
                         persisted state. If provided, secrets are encrypted when
                         saving and decrypted when loading. If not provided, secrets
@@ Expand All / @@ -241,35 +261,69 @@ def create( @@
             except FileNotFoundError:
                 base_text = None
+            context: dict[str, object] = {}
+            registry = llm_registry
+            if registry is None:
+                from openhands.sdk.llm.llm_registry import LLMRegistry
+                registry = LLMRegistry()
+            context["llm_registry"] = registry
+            # Ensure we have a registry available during both dump and validate.
+            #
+            # We do NOT implicitly write profile files here. Instead, persistence will
+            # store a profile reference only when the runtime LLM already has an
+            # explicit ``profile_id``.
             # ---- Resume path ----
             if base_text:
-                # Use cipher context for decrypting secrets if provided
-                context = {"cipher": cipher} if cipher else None
-                state = cls.model_validate(json.loads(base_text), context=context)
+                base_payload = json.loads(base_text)
+                # Add cipher context for decrypting secrets if provided
+                if cipher:
+                    context["cipher"] = cipher
-                # Restore the conversation with the same id
-                if state.id != id:
+                persisted_id = ConversationID(base_payload.get("id"))
+                if persisted_id != id:
                     raise ValueError(
                         f"Conversation ID mismatch: provided {id}, "
-                        f"but persisted state has {state.id}"
+                        f"but persisted state has {persisted_id}"
                     )
+                persisted_agent_payload = base_payload.get("agent")
+                if persisted_agent_payload is None:
+                    raise ValueError("Persisted conversation is missing agent state")
                 # Attach event log early so we can read history for tool verification
+                event_log = EventLog(file_store, dir_path=EVENTS_DIR)
+                persisted_agent = AgentBase.model_validate(
+                    persisted_agent_payload,
+                    context={"llm_registry": registry},
+                )
+                agent.verify(persisted_agent, events=event_log)
+                # Use runtime-provided Agent directly (PR #1542 / issue #1451)
+                #
+                # Persist LLMs as profile references only when an explicit profile_id is
+                # set on the runtime LLM.
+                agent_payload = agent.model_dump(
+                    mode="json",
+                    exclude_none=True,
+                    context={"expose_secrets": True, "persist_profile_ref": True},
+                )
+                if agent.llm.profile_id and agent.llm.profile_ref:
+                    agent_payload["llm"] = agent.llm.to_profile_ref()
+                base_payload["agent"] = agent_payload
+                base_payload["workspace"] = workspace.model_dump(mode="json")
+                base_payload["max_iterations"] = max_iterations
+                state = cls.model_validate(base_payload, context=context)
                 state._fs = file_store
-                state._events = EventLog(file_store, dir_path=EVENTS_DIR)
+                state._events = event_log
                 state._cipher = cipher
-                # Verify compatibility (agent class + tools)
-                agent.verify(state.agent, events=state._events)
-                # Commit runtime-provided values (may autosave)
                 state._autosave_enabled = True
-                state.agent = agent
-                state.workspace = workspace
-                state.max_iterations = max_iterations
-                # Note: stats are already deserialized from base_state.json above.
-                # Do NOT reset stats here - this would lose accumulated metrics.
                 logger.info(
                     f"Resumed conversation {state.id} from persistent storage.\n"
@@ Expand Down @@

feat(llm): add LLM profiles #1843

Are you sure you want to change the base?

Uh oh!

feat(llm): add LLM profiles #1843

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

all-hands-bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

enyst Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

all-hands-bot Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

enyst Feb 3, 2026 •

edited

Loading