[feat] support o3-mini #6570

xingyaoww · 2025-01-31T22:38:33Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

Link of any specific issues this addresses

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:66d0fbb-nikolaik   --name openhands-app-66d0fbb   docker.all-hands.dev/all-hands-ai/openhands:66d0fbb

PeterDaveHello · 2025-01-31T22:53:03Z

openhands/runtime/utils/runtime_templates/Dockerfile.j2

@@ -61,7 +61,8 @@ RUN if [ -z "${RELEASE_TAG}" ]; then \
    fi && \
    wget https://github.com/${RELEASE_ORG}/openvscode-server/releases/download/${RELEASE_TAG}/${RELEASE_TAG}-linux-${arch}.tar.gz && \
    tar -xzf ${RELEASE_TAG}-linux-${arch}.tar.gz && \
-    mv -f ${RELEASE_TAG}-linux-${arch} ${OPENVSCODE_SERVER_ROOT} && \
+    if [ -d "${OPENVSCODE_SERVER_ROOT}" ]; then rm -rf "${OPENVSCODE_SERVER_ROOT}"; fi && \
+    mv ${RELEASE_TAG}-linux-${arch} ${OPENVSCODE_SERVER_ROOT} && \


Is this o3-mini related?

This is not, but it fixes an issue in main when running evals. We could keep them together in this PR or I can merge this on a separate PR

Cool, I’m just a passerby who recently tried to add o3-mini support to this project. While working on it, I came across this PR and thought I’d check it out. Keeping a clean Git history is usually helpful for troubleshooting and makes it easier for others to jump in, so I just wanted to share that thought. Of course, it’s totally up to you as the Collaborator. Thanks for reading!

regismesquita · 2025-02-01T01:39:18Z

o3-mini seems to be missing on missing on frontend/src/utils/verified-models.ts , not sure if you want to add it only after testing or missed this.

xingyaoww · 2025-02-01T03:54:49Z

o3-mini seems to be missing on missing on frontend/src/utils/verified-models.ts , not sure if you want to add it only after testing or missed this.

Yep! We will add it to the verified model list (or even as one of the default model) once eval finishes and turns favorably

openhands/core/config/llm_config.py

xingyaoww · 2025-02-02T22:53:15Z

Okay official result for o3-mini after 4 runs.. The cost for 4 runs is about 1257.39, which means 314.35 on average per run. So, conclusion: cost slightly less than sonnet and performs slightly weaker than Claude.

https://openhands-ai.slack.com/archives/C06PB3T5ZK6/p1738536767441849?thread_ts=1738363049.253389&cid=C06PB3T5ZK6

enyst · 2025-02-02T23:37:59Z

openhands/llm/llm.py

@@ -148,17 +165,11 @@ def __init__(
            base_url=self.config.base_url,
            api_version=self.config.api_version,
            custom_llm_provider=self.config.custom_llm_provider,
-            max_tokens=self.config.max_output_tokens,
+            max_completion_tokens=self.config.max_output_tokens,


This is necessary for o3-mini, but I wonder, does it work with the others like before? litellm translates max_tokens, but it might be safer to add max_completion_tokens only for reasoning models.

enyst

Thank you for all the evals, this is exciting! Interesting times ahead.

Not sure, but I am a bit concerned that max_completion_tokens is newer and intended for reasoning models. It might be safer to set it only for them for now? I didn't find something clear on litellm.

xingyaoww · 2025-02-03T15:15:26Z

@enyst

I looked through LiteLLM Repo, and it seems LiteLLM does translate them all: https://github.com/search?q=repo%3ABerriAI%2Flitellm+max_completion_tokens&type=code&p=2

For anthropic it didn't make any differences at all: max_tokens and max_completion_tokens all got translated to max_tokens_to_sample.

Maybe let's merge it and use it for a while to see if issue comes up?

enyst · 2025-02-03T15:17:37Z

OK let's merge then!

xingyaoww · 2025-02-03T15:26:30Z

(For sanity check - I build from this branch and try running claude, there's no issue - probably safe to merge now

xingyaoww added 3 commits January 31, 2025 20:55

fix existing dir

2c0c684

llm fixes

de87b8d

add o3 mini to fncall model

ebec2ca

PeterDaveHello reviewed Jan 31, 2025

View reviewed changes

set default reasoning effort to high

fb98f9d

enyst reviewed Feb 1, 2025

View reviewed changes

openhands/core/config/llm_config.py Show resolved Hide resolved

xingyaoww marked this pull request as ready for review February 2, 2025 22:32

xingyaoww requested a review from enyst February 2, 2025 22:32

xingyaoww added 3 commits February 2, 2025 22:54

remove registered model

50ef2ed

add o3-mini to verified models

6fb8b5a

Merge branch 'main' into xw/o3mini

66d0fbb

enyst reviewed Feb 2, 2025

View reviewed changes

enyst approved these changes Feb 2, 2025

View reviewed changes

xingyaoww changed the title ~~Support o3-mini~~ [feat] support o3-mini Feb 3, 2025

xingyaoww merged commit 622fc52 into main Feb 3, 2025
18 checks passed

xingyaoww deleted the xw/o3mini branch February 3, 2025 15:26

zchn pushed a commit to zchn/OpenHands that referenced this pull request Feb 4, 2025

[feat] support o3-mini (All-Hands-AI#6570)

fe6084d

adityasoni9998 pushed a commit to adityasoni9998/OpenHands that referenced this pull request Feb 7, 2025

[feat] support o3-mini (All-Hands-AI#6570)

7572df2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] support o3-mini #6570

[feat] support o3-mini #6570

xingyaoww commented Jan 31, 2025 •

edited by github-actions bot

Loading

PeterDaveHello Jan 31, 2025

xingyaoww Feb 1, 2025

PeterDaveHello Feb 1, 2025

regismesquita commented Feb 1, 2025 •

edited

Loading

xingyaoww commented Feb 1, 2025 •

edited

Loading

xingyaoww commented Feb 2, 2025

enyst Feb 2, 2025

enyst left a comment

xingyaoww commented Feb 3, 2025

enyst commented Feb 3, 2025

xingyaoww commented Feb 3, 2025

[feat] support o3-mini #6570

[feat] support o3-mini #6570

Conversation

xingyaoww commented Jan 31, 2025 • edited by github-actions bot Loading

PeterDaveHello Jan 31, 2025

Choose a reason for hiding this comment

xingyaoww Feb 1, 2025

Choose a reason for hiding this comment

PeterDaveHello Feb 1, 2025

Choose a reason for hiding this comment

regismesquita commented Feb 1, 2025 • edited Loading

xingyaoww commented Feb 1, 2025 • edited Loading

xingyaoww commented Feb 2, 2025

enyst Feb 2, 2025

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

xingyaoww commented Feb 3, 2025

enyst commented Feb 3, 2025

xingyaoww commented Feb 3, 2025

xingyaoww commented Jan 31, 2025 •

edited by github-actions bot

Loading

regismesquita commented Feb 1, 2025 •

edited

Loading

xingyaoww commented Feb 1, 2025 •

edited

Loading