feat: add Cohere plugin for LiveKit Agents #4220

darshankparmar · 2025-12-10T16:39:30Z

No description provided.

CLAassistant · 2025-12-10T16:39:36Z

All committers have signed the CLA.

Hormold · 2025-12-11T01:09:47Z

livekit-plugins/livekit-plugins-cohere/livekit/plugins/cohere/models.py

+    "command-light-nightly",
+]
+
+EmbeddingModels = Literal[


I don't think that embedding models can be used in voice calls

Hormold · 2025-12-11T01:12:44Z

Hello, and thank you for your PR!
In most cases, if the API is quite similar to OpenAI's, we push new providers as additional function to OpenAI (e.g., recent PRs regarding OpenRouter or OVH) plugin. Could you update it to just be part of openai plugin?

darshankparmar · 2025-12-11T05:13:32Z

Hello, and thank you for your PR! In most cases, if the API is quite similar to OpenAI's, we push new providers as additional function to OpenAI (e.g., recent PRs regarding OpenRouter or OVH) plugin. Could you update it to just be part of openai plugin?

Hi @Hormold thanks for your message! I will. 😄

This reverts commit e1c5088.

Hormold · 2025-12-11T20:14:06Z

Hey! Tested the Cohere integration and found two issues that need fixing before this can work properly with voice agents.

First, Cohere API returns 400: message must be at least 1 token long when there's no user message in the chat context. This happens in voice agents when generate_reply(instructions="...") is called without user input (like in on_enter() to greet the user). OpenAI handles this fine but Cohere doesn't... looks like Cohere requires at least one user message to generate a response.

Second, tool calling breaks with 400: schema 'type' must be a string. Array 'type' is unsupported for this model. Cohere's OpenAI-compatible API seems to have stricter JSON schema requirements than OpenAI - it doesn't accept union/array types that LiveKit generates for function tools. Need to figure out what schema format Cohere actually expects and adapt the tool serialization.

darshankparmar · 2025-12-12T14:27:27Z

Thanks for the feedback!

darshankparmar · 2025-12-12T14:43:56Z

I took a look at the first issue. One idea: we could add a check in the chat method (Cohere-only) to see if there’s at least one user message in the context. If not, we either auto-inject a small placeholder user message (e.g. “Hello”) so Cohere is happy, or just throw an exception instead. Not sure which direction makes more sense here, but this would at least guarantee we don’t hit that 400 on empty contexts.

darshankparmar · 2025-12-12T15:24:04Z

I dug into the second issue as well. Ended up fixing it by setting _strict_tool_schema=False for Cohere.

darshankparmar · 2025-12-19T10:41:25Z

I took a look at the first issue. One idea: we could add a check in the chat method (Cohere-only) to see if there’s at least one user message in the context. If not, we either auto-inject a small placeholder user message (e.g. “Hello”) so Cohere is happy, or just throw an exception instead. Not sure which direction makes more sense here, but this would at least guarantee we don’t hit that 400 on empty contexts.

@Hormold can you confirm?

chenghao-mou · 2025-12-19T11:27:29Z

Adding a placeholder message should be fine. This is what we did with Gemini Realtime:

darshankparmar · 2025-12-21T03:12:03Z

@Hormold pushed the Cohere issues fix, PR is ready for review.

Hormold · 2025-12-22T22:15:00Z

Hey, I tested, and the PR looks good. One minor thing is a conflict here. Also, I encountered a couple of timeouts on Cohere responses.

Hormold · 2025-12-22T22:20:33Z

Merge break imports. Could you please add ChatMessage to imports again?

…mand-r-08-2024

darshankparmar · 2025-12-23T04:11:49Z

Hey, I tested, and the PR looks good. One minor thing is a conflict here. Also, I encountered a couple of timeouts on Cohere responses.

Thanks for testing! Regarding the timeouts - Cohere API can have high latency (25+ seconds TTFT) which may cause timeout errors in real-time applications. Consider:

For a small, fast model: command-r7b-12-2024
For general purpose use: command-r-08-2024
For more advanced capabilities: command-a-03-2025
Increasing timeout values for production use
Setting appropriate max_completion_tokens to reduce response time

I also updated the latest Cohere Command models (text generation).

Hormold · 2025-12-23T21:52:41Z

Thanks! Works great!

darshankparmar · 2025-12-24T18:52:42Z

Thanks! Works great!

Thanks for the feedback! Glad it's working well.
Any final changes needed or ready to merge? 🚀

darshankparmar and others added 2 commits December 10, 2025 21:43

feat: add Cohere plugin for LiveKit Agents

e1c5088

Merge branch 'livekit:main' into feat/cohere-plugin

68c60d9

Hormold reviewed Dec 11, 2025

View reviewed changes

darshankparmar marked this pull request as draft December 11, 2025 05:13

darshankparmar added 2 commits December 11, 2025 22:54

Revert "feat: add Cohere plugin for LiveKit Agents"

321f131

This reverts commit e1c5088.

feat(cohere): add Cohere plugin for LiveKit Agents

926acc3

darshankparmar marked this pull request as ready for review December 11, 2025 17:33

darshankparmar requested a review from Hormold December 11, 2025 17:35

Hormold self-assigned this Dec 11, 2025

Hormold marked this pull request as draft December 11, 2025 21:27

fix(cohere): use legacy tool schemas to avoid array type errors

25b7008

fix(cohere): add placeholder user message for empty chat contexts

0724813

darshankparmar marked this pull request as ready for review December 21, 2025 03:11

Merge branch 'main' into feat/cohere-plugin

ad78e64

darshankparmar added 2 commits December 23, 2025 08:59

fix: broken import after merge with main

7d4e04d

fix(cohere): update to current model lineup and change default to com…

13bdd34

…mand-r-08-2024

feat: add Cohere plugin for LiveKit Agents #4220

Are you sure you want to change the base?

feat: add Cohere plugin for LiveKit Agents #4220

Conversation

darshankparmar commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Hormold Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Hormold commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

darshankparmar commented Dec 11, 2025

Uh oh!

Hormold commented Dec 11, 2025

Uh oh!

darshankparmar commented Dec 12, 2025

Uh oh!

darshankparmar commented Dec 12, 2025

Uh oh!

darshankparmar commented Dec 12, 2025

Uh oh!

darshankparmar commented Dec 19, 2025

Uh oh!

chenghao-mou commented Dec 19, 2025

Uh oh!

darshankparmar commented Dec 21, 2025

Uh oh!

Hormold commented Dec 22, 2025

Uh oh!

Hormold commented Dec 22, 2025

Uh oh!

darshankparmar commented Dec 23, 2025

Uh oh!

Hormold commented Dec 23, 2025

Uh oh!

darshankparmar commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

darshankparmar commented Dec 10, 2025 •

edited

Loading

CLAassistant commented Dec 10, 2025 •

edited

Loading

Hormold commented Dec 11, 2025 •

edited

Loading