add action input as parameters for tool execution in conversational agent #3200

jngz-es · 2024-11-04T18:31:25Z

Description

Related Issues

Resolves #[Issue number to be closed when this PR is merged]
#3134

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/AgentUtils.java

mingshl · 2024-11-14T01:06:52Z

tests passed but failed in upload, not related to this code change. Approved.

Run actions/upload-artifact@v4
/usr/bin/docker exec  a6c6ef6ad38d2e9993d03ba2bdc50f2146f892a4a109fc9fecaf2c66802943f0 sh -c "cat /etc/*release | grep ^ID"
/__e/node[20](https://github.com/opensearch-project/ml-commons/actions/runs/11670434010/job/32954019963?pr=3200#step:8:21)/bin/node: /lib64/libm.so.6: version `GLIBC_2.27' not found (required by /__e/node20/bin/node)
/__e/node20/bin/node: /lib64/libc.so.6: version `GLIBC_2.28' not found (required by /__e/node20/bin/node)
``'

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/AgentUtils.java

reuschling · 2024-11-18T17:00:18Z

The changes for AgentUtils look fine, but is AgentUtils used for conversational agents? The tool parameters are build inside MLConversationalFlowAgentRunner.getToolExecuteParams(MLToolSpec toolSpec, Map<String, String> params), i.e. here. There is no AgentUtils invocation. The only invocation of AgentUtils.‎‎constructToolParams I found is inside MLChatAgentRunner.

I showed a code proposal for getToolExecuteParams at #2977 (comment), the only difference is that there is no dedicated actionInput parameter, the actionInput is the "input" entry so far, which has to be temporarily stored inside a local variable.

I would also highly recommend to add the new "action_input" parameter to flow agents (i.e. MLFlowAgentRunner.getToolExecuteParams) also. There is no use of AgentUtils too. Of course there is the possibility with parameters.previous_tool_name.output, but tool specifications should act the same independent where they should be used, whether inside flow or conversational agents.

…nversational agent Signed-off-by: Jing Zhang <jngz@amazon.com>

jngz-es · 2024-12-02T18:47:02Z

The changes for AgentUtils look fine, but is AgentUtils used for conversational agents? The tool parameters are build inside MLConversationalFlowAgentRunner.getToolExecuteParams(MLToolSpec toolSpec, Map<String, String> params), i.e. here. There is no AgentUtils invocation. The only invocation of AgentUtils.‎‎constructToolParams I found is inside MLChatAgentRunner.

I showed a code proposal for getToolExecuteParams at #2977 (comment), the only difference is that there is no dedicated actionInput parameter, the actionInput is the "input" entry so far, which has to be temporarily stored inside a local variable.

I would also highly recommend to add the new "action_input" parameter to flow agents (i.e. MLFlowAgentRunner.getToolExecuteParams) also. There is no use of AgentUtils too. Of course there is the possibility with parameters.previous_tool_name.output, but tool specifications should act the same independent where they should be used, whether inside flow or conversational agents.

Hi @reuschling , thanks for the comments. Yeah, you are right. The changes is only for conversational agents. As you also mentioned, the parameters.previous_tool_name.output is designed as action input for flow agents which is a sequence of tools. I don't see a use case of flow agent where the parameters.previous_tool_name.output could not meet the requirement but the action input did.

Signed-off-by: Jing Zhang <jngz@amazon.com>

reuschling · 2024-12-03T10:48:15Z

Hi @reuschling , thanks for the comments. Yeah, you are right. The changes is only for conversational agents.

Sorry, but I think you misunderstood me. Currently this PR makes NO change to conversational agents. It changes AgentUtils which is ONLY invoked inside MLChatAgentRunner. For conversational agents, you have to modify MLConversationalFlowAgentRunner.

As you also mentioned, the parameters.previous_tool_name.output is designed as action input for flow agents which is a sequence of tools. I don't see a use case of flow agent where the parameters.previous_tool_name.output could not meet the requirement but the action input did.

Exactly. But when someone wants to use the same tool inside a conversational agent and inside a flow agent, he/she has to change the tool definition from parameters.previous_tool_name.output to action input or vice versa. Why giving the same thing different names? It's not for more functionality, but for logic and design purposes.

ylwu-amzn · 2024-12-04T08:48:54Z

@reuschling Thanks for reviewing this PR. From #2918 (comment), I think you know "MLChatAgentRunner uses the AgentUtils method AgentUtils.constructToolParams for generating the params for a tool."

The name seems confusing, but actually MLChatAgentRunner is for conversational agent (code link).
Refer to this tutorial for differences between different agent types.

Test

I have tested this PR with Bedrock anthropic.claude-instant-v1 model.

You should create a test_population_data index , which has a text field population_description first. Refer to this tutorial for creating this index.
Need to configure config of SearchIndexTool. Add a static input template with placeholder llm_generated_action_input. The placeholder llm_generated_action_input will be substituted with input generated by LLM.

POST _plugins/_ml/agents/_register
{
    "name": "Test Agent",
    "type": "conversational",
    "description": "Simple agent to test the agent framework",
    "llm": {
        "model_id": "<your LLM model id>",
        "parameters": {
            "max_iteration": 5,
            "stop_when_no_tool_found": true,
            "disable_trace": false
        }
    },
    "memory": {
        "type": "conversation_index"
    },
    "app_type": "chat_with_rag",
    "tools": [
        {
            "type": "SearchIndexTool",
            "description": "A tool to search opensearch index with natural language question. If you don't know answer for some question, you should always try to search data with this tool. Action Input: <natural language question>",
            "include_output_in_agent_response": true,
            "config": {
                "input": "{\"index\": \"test_population_data\", \"query\": {\"query\":{\"match\":{\"population_description\":\"${parameters.llm_generated_action_input}\"}}} }"
            }
        }
    ]
}

Test agent with

{
  "parameters": {
    "question": "what's the population increase of Seattle from 2021 to 2023?"
  }
}

Feel free to test. BTW, you can find @jngz-es and me in the public OpenSearch ml Slack channel https://join.slack.com/t/opensearch/shared_invite/zt-2r5scz3ty-5SMPhqJE_Lk2HqC6ex4mWg, welcome to join. I'm ok to jump to a call if that's easier to explain details with a demo.

ylwu-amzn · 2024-12-04T09:06:39Z

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/AgentUtils.java

@@ -472,6 +472,11 @@ public static Map<String, String> constructToolParams(
        if (toolSpecConfigMap != null) {
            toolParams.putAll(toolSpecConfigMap);
        }
+        toolParams.put("llm_generated_action_input", actionInput);


Maybe no need to mention action explicitly considering REST API uses tool. User may feel confused about tool and action. How about just llm_generated_input ?

can also consider using constant for this string since it is reused in the tests

reuschling · 2024-12-04T12:27:45Z

@reuschling Thanks for reviewing this PR. From #2918 (comment), I think you know "MLChatAgentRunner uses the AgentUtils method AgentUtils.constructToolParams for generating the params for a tool."

The name seems confusing, but actually MLChatAgentRunner is for conversational agent (code link).

Thanks a lot @ylwu-amzn for clarification with the code link. Yes the name confused me, sorry about that @jngz-es . In this case I'm also fine with the changes :)

jngz-es requested review from b4sjoo, dhrubo-os, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, austintlee, HenryL27, sam-herman and xinyual as code owners November 4, 2024 18:31

jngz-es had a problem deploying to ml-commons-cicd-env November 4, 2024 18:31 — with GitHub Actions Failure

jngz-es temporarily deployed to ml-commons-cicd-env November 4, 2024 18:31 — with GitHub Actions Inactive

pyek-bot reviewed Nov 4, 2024

View reviewed changes

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/AgentUtils.java Outdated Show resolved Hide resolved

jngz-es mentioned this pull request Nov 13, 2024

add config field in MLToolSpec for static parameters #2977

Merged

5 tasks

jngz-es had a problem deploying to ml-commons-cicd-env November 13, 2024 21:18 — with GitHub Actions Failure

jngz-es had a problem deploying to ml-commons-cicd-env November 13, 2024 22:29 — with GitHub Actions Failure

mingshl previously approved these changes Nov 14, 2024

View reviewed changes

ylwu-amzn reviewed Nov 15, 2024

View reviewed changes

ml-algorithms/src/main/java/org/opensearch/ml/engine/algorithms/agent/AgentUtils.java Outdated Show resolved Hide resolved

add llm generated action input as parameters for tool execution in co…

b1c518a

…nversational agent Signed-off-by: Jing Zhang <jngz@amazon.com>

jngz-es dismissed mingshl’s stale review via b1c518a December 2, 2024 18:20

jngz-es force-pushed the action_input branch from fa931ee to b1c518a Compare December 2, 2024 18:20

jngz-es had a problem deploying to ml-commons-cicd-env December 2, 2024 18:21 — with GitHub Actions Failure

jngz-es had a problem deploying to ml-commons-cicd-env December 2, 2024 18:32 — with GitHub Actions Failure

add UT for null action input

997c50b

Signed-off-by: Jing Zhang <jngz@amazon.com>

jngz-es force-pushed the action_input branch from c493fee to 997c50b Compare December 2, 2024 18:56

jngz-es had a problem deploying to ml-commons-cicd-env December 2, 2024 18:57 — with GitHub Actions Failure

jngz-es temporarily deployed to ml-commons-cicd-env December 2, 2024 18:57 — with GitHub Actions Inactive

jngz-es had a problem deploying to ml-commons-cicd-env December 2, 2024 21:23 — with GitHub Actions Failure

jngz-es temporarily deployed to ml-commons-cicd-env December 2, 2024 22:24 — with GitHub Actions Inactive

jngz-es temporarily deployed to ml-commons-cicd-env December 2, 2024 23:21 — with GitHub Actions Inactive

jngz-es requested review from pyek-bot, ylwu-amzn, dbwiddis and mingshl December 3, 2024 17:06

ylwu-amzn reviewed Dec 4, 2024

View reviewed changes

pyek-bot approved these changes Dec 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add action input as parameters for tool execution in conversational agent #3200

add action input as parameters for tool execution in conversational agent #3200

jngz-es commented Nov 4, 2024

mingshl commented Nov 14, 2024

reuschling commented Nov 18, 2024 •

edited

Loading

jngz-es commented Dec 2, 2024

reuschling commented Dec 3, 2024

ylwu-amzn commented Dec 4, 2024 •

edited

Loading

ylwu-amzn Dec 4, 2024

pyek-bot Dec 5, 2024

reuschling commented Dec 4, 2024

add action input as parameters for tool execution in conversational agent #3200

Are you sure you want to change the base?

add action input as parameters for tool execution in conversational agent #3200

Conversation

jngz-es commented Nov 4, 2024

Description

Related Issues

Check List

mingshl commented Nov 14, 2024

reuschling commented Nov 18, 2024 • edited Loading

jngz-es commented Dec 2, 2024

reuschling commented Dec 3, 2024

ylwu-amzn commented Dec 4, 2024 • edited Loading

Test

ylwu-amzn Dec 4, 2024

Choose a reason for hiding this comment

pyek-bot Dec 5, 2024

Choose a reason for hiding this comment

reuschling commented Dec 4, 2024

reuschling commented Nov 18, 2024 •

edited

Loading

ylwu-amzn commented Dec 4, 2024 •

edited

Loading