feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox #1215

xingyaoww · 2024-04-18T17:40:33Z

This PR adds Python code execution capability from CodeAct based on IPython to augment the bash command line interface for current agents.

Eventual expected outcome (see video below): Agents can automatically solve user's requests by writing and executing Python code (interactively) and running bash commands (based on our existing sandbox). This interactive Python interpreter supports displaying figures/charts in markdown format via base64, which we can later integrate into the front-end.

codeact-demo.mp4

Demo of the expected capability - work-in-progress.

Right now, it only supports SSHBox, and I have tested and confirmed it is working in ubuntu with both RUN_AS_DEVIN='True' and RUN_AS_DEVIN='False'.

You can test it locally by first building the image via ./containers/build.sh sandbox (comment out the build architecture line if error), then run python3 opendevin/sandbox/ssh_box.py. Any command you type that starts with py: will be forwarded to the Python Interpreter for execution.

rbren · 2024-04-18T19:18:46Z

opendevin/sandbox/ssh_box.py

@@ -206,6 +211,28 @@ def execute(self, cmd: str) -> Tuple[int, str]:
        exit_code = int(exit_code.lstrip('echo $?').strip())
        return exit_code, command_output

+    def setup_jupyter(self):


Instead of putting this into SSHBox, could we have a JupyterBox that extends SSHBox?

I am also debating about this: Ideally, the Python execution capability should be parallel to bash execution so that every agent can resort to one of them when needed. In that case, I think it probably makes sense for us to extend the existing SSHBox to have a separate Python execution capability via execute_python as it won't interfere with the existing actions going through the SSHBox and other existing agents can also have python execution natively supported easily without specifying a different SSHJupyterBox?

Or we can just do the sub-class and set the SSHJupyterBox to the default just to make the code look better :) - anyway, I can create a sub-class first, and then we can decide whether we want this Python execution capability to be a default behavior.

rbren · 2024-04-18T19:19:51Z

opendevin/sandbox/ssh_box.py

@@ -371,6 +402,12 @@ def close(self):
                ssh_box.kill_background(bg_cmd.id)
                logger.info('Background process killed')
                continue
+            if user_input.startswith('py:'):


Maybe we should have a JupyterAgent or something to demonstrate this functionality?

For this PR, I'm just trying to get the sandbox component ready :) I'll submit follow-up PRs to update CodeActAgent using the sandbox with instructions on how to use the fine-tuned open-source models (CodeAgentAgent-Mistral-7b - it can run on a laptop!).

rbren · 2024-04-18T19:20:04Z

This is very cool! The demo is neat

yufansong · 2024-04-19T03:44:16Z

opendevin/sandbox/jupyter_kernel.py

+        self.base_url = f'http://{url_suffix}'
+        self.base_ws_url = f'ws://{url_suffix}'


Suggested change

self.base_url = f'http://{url_suffix}'

self.base_ws_url = f'ws://{url_suffix}'

self.base_url = f'http://{url_suffix}'

self.base_ws_url = f'ws://{url_suffix}'

self.ws = None

My original implement was like this but mypy was not super happy about it: it assume self.ws is None and does not have methods like 'send_messages' etc, hence causing issue for its typing system :(

Fine, let's keep the original code.

yufansong · 2024-04-19T03:45:14Z

opendevin/sandbox/jupyter_kernel.py

+        if not hasattr(self, 'ws') or not self.ws:
+            return


Suggested change

if not hasattr(self, 'ws') or not self.ws:

return

if not self.ws:

return

yufansong · 2024-04-19T03:46:41Z

opendevin/sandbox/jupyter_kernel.py

+                )
+
+    async def _connect(self):
+        if hasattr(self, 'ws') and self.ws:


Suggested change

if hasattr(self, 'ws') and self.ws:

if self.ws:

li-boxuan · 2024-04-19T04:05:52Z

containers/sandbox/Dockerfile

+RUN pip install jupyterlab notebook jupyter_kernel_gateway
+# Add common data science utils
+RUN pip install transformers[torch]
+RUN pip install torch --index-url https://download.pytorch.org/whl/cpu


Do you expect agent would run torch to solve user's problems? An example would be cool.

Yes! I did try to use CodeActAgent-Mistral 7B and let it write Pytorch code for some simple tasks (e.g., regression), and it did actually solve the problem by 80% - I suspect by switching towards a stronger base model (e.g., 70B), the agent will be able to tackle such problems, so i'd like to keep these as in the container.

alternatively in the future if we are concerned about the size of the container, we can make these optional and/or allow user to use their customized executor image.

I don't really have a concrete qualitative example at hand using PyTorch, the closest example I have is having the agent/LLM using SKLearn for machine learning tasks: https://twitter.com/xingyaow_/status/1754556862949994917

that's cool!

yufansong

LGTM

xingyaoww · 2024-04-19T08:55:40Z

@rbren, I will merge this so that I can get the new sandbox image built for further development -- but feel free to revert if you think something is wrong! ;)

…for Sandbox (All-Hands-AI#1215) * add initial version of py interpreter * fix bug * fix async issue * remove debugging print statement * initialize kernel & update printing * fix port mapping * uncomment debug lines * fix poetry lock * make jupyter py interpreter into a subclass

…rpreter for Sandbox (All-Hands-AI#1215)" This reverts commit 492feec.

…rpreter for Sandbox (#1215)" (#1229) This reverts commit 492feec.

xingyaoww added 7 commits April 18, 2024 12:27

add initial version of py interpreter

45fe5eb

fix bug

4792a13

fix async issue

253415d

remove debugging print statement

733664b

initialize kernel & update printing

22c0e36

fix port mapping

a154790

uncomment debug lines

9626a74

xingyaoww requested a review from rbren April 18, 2024 17:40

xingyaoww added 2 commits April 18, 2024 13:18

fix poetry lock

dbdead8

Merge branch 'main' into add-jupyter-sandbox

aa0263e

rbren reviewed Apr 18, 2024

View reviewed changes

xingyaoww added 2 commits April 18, 2024 21:05

make jupyter py interpreter into a subclass

437882d

Merge branch 'main' into add-jupyter-sandbox

68866db

yufansong reviewed Apr 19, 2024

View reviewed changes

li-boxuan reviewed Apr 19, 2024

View reviewed changes

yufansong approved these changes Apr 19, 2024

View reviewed changes

xingyaoww changed the title ~~(feat) Add Jupyter Kernel for Interactive Python Interpreter for Sandbox~~ feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox Apr 19, 2024

xingyaoww merged commit 492feec into All-Hands-AI:main Apr 19, 2024

xingyaoww mentioned this pull request Apr 19, 2024

fix(sandbox): Extend the sandbox execute method to support python instead of using additional method #1228

Closed

xingyaoww added a commit to xingyaoww/OpenHands that referenced this pull request Apr 19, 2024

Revert "feat(sandbox): Add Jupyter Kernel for Interactive Python Inte…

89d8289

…rpreter for Sandbox (All-Hands-AI#1215)" This reverts commit 492feec.

xingyaoww mentioned this pull request Apr 19, 2024

Revert "feat(sandbox): Add Jupyter Kernel for Interactive Python Inte… #1229

Merged

rbren pushed a commit that referenced this pull request Apr 19, 2024

Revert "feat(sandbox): Add Jupyter Kernel for Interactive Python Inte…

871eefe

…rpreter for Sandbox (#1215)" (#1229) This reverts commit 492feec.

xingyaoww mentioned this pull request Apr 20, 2024

feat(sandbox): Candidate Implementation of Sandbox Plugin to Support Jupyter #1255

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox #1215

feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox #1215

xingyaoww commented Apr 18, 2024

rbren Apr 18, 2024

xingyaoww Apr 19, 2024

rbren Apr 18, 2024

xingyaoww Apr 19, 2024 •

edited

Loading

rbren commented Apr 18, 2024

yufansong Apr 19, 2024

xingyaoww Apr 19, 2024

yufansong Apr 19, 2024 •

edited

Loading

yufansong Apr 19, 2024

yufansong Apr 19, 2024

li-boxuan Apr 19, 2024 •

edited

Loading

xingyaoww Apr 19, 2024

xingyaoww Apr 19, 2024

li-boxuan Apr 19, 2024

yufansong left a comment

xingyaoww commented Apr 19, 2024

		self.base_url = f'http://{url_suffix}'
		self.base_ws_url = f'ws://{url_suffix}'

feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox #1215

feat(sandbox): Add Jupyter Kernel for Interactive Python Interpreter for Sandbox #1215

Conversation

xingyaoww commented Apr 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xingyaoww Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

rbren commented Apr 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yufansong Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

li-boxuan Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yufansong left a comment

Choose a reason for hiding this comment

xingyaoww commented Apr 19, 2024

xingyaoww Apr 19, 2024 •

edited

Loading

yufansong Apr 19, 2024 •

edited

Loading

li-boxuan Apr 19, 2024 •

edited

Loading