GH-4596: Handle candidates containing both text and tool calls in VertexAiGeminiChatModel #4599

q-nathangrand · 2025-10-10T12:16:57Z

…ol calls in VertexAiGeminiChatModel Signed-off-by: NathanGrand <nathangrand@quantexa.com>

q-nathangrand · 2025-10-10T12:31:15Z

...-ai-gemini/src/main/java/org/springframework/ai/vertexai/gemini/VertexAiGeminiChatModel.java

+		String text = parts.stream()
+			.filter(part -> part.hasText() && !part.getText().isEmpty())
+			.map(Part::getText)
+			.collect(Collectors.joining(" "));


It didn't make sense to me to be turning one candidate into multiple generations, but perhaps I'm misunderstanding?

Previous behaviour collected all parts containing function tools and combined into one AssistantMessage; idea here was to take the same approach here if multiple parts contain text.

q-nathangrand · 2025-10-10T13:14:47Z

Just testing this further manually

q-nathangrand · 2025-10-10T14:23:27Z

Just testing this further manually

Seems to be working well

ericbottard · 2025-10-13T15:32:49Z

Hi @q-nathangrand, thanks for taking the time to submit a PR for this issue.

Could you provide a test case that exhibits the initial problem (ie a response that contains a mixture of FunctionCall and non-FunctionCall parts). Either as maybe a prompt that triggers such a response, or even better as an integration test.

Also, could you confirm that this issue could arise in both the streaming and non-streaming cases?

q-nathangrand · 2025-10-13T16:59:40Z

Hi @q-nathangrand, thanks for taking the time to submit a PR for this issue.

Could you provide a test case that exhibits the initial problem (ie a response that contains a mixture of FunctionCall and non-FunctionCall parts). Either as maybe a prompt that triggers such a response, or even better as an integration test.

Also, could you confirm that this issue could arise in both the streaming and non-streaming cases?

Hey,

I've only been testing it in the non-streaming version, but the responseCandidateToGeneration is used by both streaming and non-streaming code paths.

Integration test wise, not sure if I'll get time this week.

But the general idea is to encourage gemini to explain why it it calling tools as it does.

Here's a snippet (sorry for the scala/java mix)

import com.google.cloud.vertexai.VertexAI
import org.springframework.ai.chat.messages.{SystemMessage, UserMessage}
import org.springframework.ai.chat.prompt.Prompt
import org.springframework.ai.model.tool.DefaultToolCallingManager
import org.springframework.ai.support.ToolCallbacks
import org.springframework.ai.vertexai.gemini.schema.VertexToolCallingManager
import org.springframework.ai.vertexai.gemini.{VertexAiGeminiChatModel, VertexAiGeminiChatOptions}

import scala.collection.JavaConverters._

object Test extends App {

  private val vertexAi = new VertexAI.Builder()
    .setProjectId("your-project-id")
    .setLocation("us-east5")
    .build()

  private val toolCallbacks = ToolCallbacks.from(new TestTools())
  private val toolNames = toolCallbacks.map(_.getToolDefinition.name()).toSet.asJava

  private val options = VertexAiGeminiChatOptions
    .builder()
    .internalToolExecutionEnabled(true)
    .temperature(0)
    .model("gemini-2.5-pro")
    .toolCallbacks(toolCallbacks :_*)
    .toolNames(toolNames)
    .build()

  private val model = VertexAiGeminiChatModel
    .builder()
    .vertexAI(vertexAi)
    .defaultOptions(options)
    .toolCallingManager(new VertexToolCallingManager(DefaultToolCallingManager.builder().build()))
    .build()

  private val response = model.call(
    Prompt
      .builder()
      .messages(
        SystemMessage.builder().text(
          "You MUST include reasoning when you issue tool calls."
        ).build,
        UserMessage.builder().text("Set an alarm for an hour from now, and tell me what time that was for").build
      )
      .build()
  )

  println(response.getResult.getOutput.getText)

}

import org.springframework.ai.tool.annotation.Tool;
import org.springframework.context.i18n.LocaleContextHolder;

import java.time.LocalDateTime;
import java.time.format.DateTimeFormatter;

public class TestTools {

    @Tool(description = "Get the current date and time in the user's timezone")
    String getCurrentDateTime() {
        System.out.println("----- Tool call: getCurrentDateTime begin -----");
        String localDateTime = LocalDateTime.now().atZone(LocaleContextHolder.getTimeZone().toZoneId()).toString();
        System.out.println("----- Tool call: getCurrentDateTime end -----");
        return localDateTime;
    }

    @Tool(description = "Set a user alarm for the given time, provided in ISO-8601 format")
    void setAlarm(String time) {
        System.out.println("----- Tool call: setAlarm being -----");
        LocalDateTime alarmTime = LocalDateTime.parse(time, DateTimeFormatter.ISO_DATE_TIME);
        System.out.println("----- Tool call: setAlarm end -----");
    }

}

Here's a couple of grabs from debugger showing the issue

ericbottard · 2025-10-14T13:35:17Z

Thanks for the extensive response, that should help us reproduce the issue and create an appropriate test!

ericbottard · 2025-10-17T08:02:52Z

Merged to main as 8e8654e

spring-projectsGH-4596: Handle candidates containing both text and to…

f4a05b7

…ol calls in VertexAiGeminiChatModel Signed-off-by: NathanGrand <nathangrand@quantexa.com>

q-nathangrand commented Oct 10, 2025

View reviewed changes

ilayaperumalg added the vertex label Oct 10, 2025

ericbottard self-assigned this Oct 13, 2025

ericbottard self-requested a review October 13, 2025 15:28

ericbottard closed this Oct 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-4596: Handle candidates containing both text and tool calls in VertexAiGeminiChatModel #4599

GH-4596: Handle candidates containing both text and tool calls in VertexAiGeminiChatModel #4599

Uh oh!

q-nathangrand commented Oct 10, 2025 •

edited

Loading

Uh oh!

q-nathangrand Oct 10, 2025

Uh oh!

q-nathangrand commented Oct 10, 2025

Uh oh!

q-nathangrand commented Oct 10, 2025

Uh oh!

ericbottard commented Oct 13, 2025

Uh oh!

q-nathangrand commented Oct 13, 2025 •

edited

Loading

Uh oh!

ericbottard commented Oct 14, 2025

Uh oh!

ericbottard commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GH-4596: Handle candidates containing both text and tool calls in VertexAiGeminiChatModel #4599

GH-4596: Handle candidates containing both text and tool calls in VertexAiGeminiChatModel #4599

Uh oh!

Conversation

q-nathangrand commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

q-nathangrand Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

q-nathangrand commented Oct 10, 2025

Uh oh!

q-nathangrand commented Oct 10, 2025

Uh oh!

ericbottard commented Oct 13, 2025

Uh oh!

q-nathangrand commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ericbottard commented Oct 14, 2025

Uh oh!

ericbottard commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

q-nathangrand commented Oct 10, 2025 •

edited

Loading

q-nathangrand commented Oct 13, 2025 •

edited

Loading