Agent improvement #1032

qingyun-wu · 2023-05-10T02:41:45Z

Comments from @gagb:
Some observations:

the agent many times starts to suggest shell commands which makes the code fail. Especially as the conversation gets longer
Sometimes the user responds with empty strings and the code agent never returns terminal and the code gets stuck in a loop. Also happens when lang=unknown eg cuz the agent didn't wrap the python code in codeblockss
The code fails if the context size > 8k
Original comment: 3b3dd60#diff-9ac9829642f8aa5ad3ed717f7f60eabedf33210195465c1f6473cd2cfd4cd2af

PR #1025

Tasks

Give feedback

handle context size overflow in AssistantAgent autogen#9

0 of 1

enhancement
Options

The text was updated successfully, but these errors were encountered:

qingyun-wu · 2023-05-12T01:38:09Z

@gagb The second problem should have been addressed in the latest PR. Let me know if you still have this observation.

gagb · 2023-05-25T17:25:47Z

More feedback based on integration with tinyRA and using gpt-3.5-turbo:

Drift: The conversation may drift and start to execute code that unrelated to the goal and possibly very unsafe. We need more safety checks on the code it suggests.
Memory refreshing: Others have found that occasionally refreshing agent memory with goal can help.
Guaranteed structured output: Currently there are no guarantees that the coding agent will output a python code block (or even use code blocks). This can cause the conversation to fail.
Shell agent: Currently agent can't execute shell commands to succeed (e.g., pip commands to install python packages).

gagb · 2023-05-25T20:56:17Z

@gagb The second problem should have been addressed in the latest PR. Let me know if you still have this observation.

I think I still happens with gpt-3.5. I haven't been able to test with gpt-4 because I don't have access to it. I am working on a feature to share failure cases from tinyRA easily.

sonichi added the enhancement New feature or request label May 12, 2023

sonichi added this to the Upgrade of autogen milestone Jun 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent improvement #1032

Agent improvement #1032

qingyun-wu commented May 10, 2023 •

edited by sonichi

Loading

Tasks

qingyun-wu commented May 12, 2023

gagb commented May 25, 2023 •

edited

Loading

gagb commented May 25, 2023

Agent improvement #1032

Agent improvement #1032

Comments

qingyun-wu commented May 10, 2023 • edited by sonichi Loading

Tasks

qingyun-wu commented May 12, 2023

gagb commented May 25, 2023 • edited Loading

gagb commented May 25, 2023

qingyun-wu commented May 10, 2023 •

edited by sonichi

Loading

gagb commented May 25, 2023 •

edited

Loading