You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Update llms.mdx (Gemini 2.0)
- Add Gemini 2.0 flash to Gemini table.
- Add link to 2 hosting paths for Gemini in Tip.
- Change to lower case model slugs vs names, user convenience.
- Add https://artificialanalysis.ai/ as alternate leaderboard.
- Move Gemma to "other" tab.
* Update llm.py (gemini 2.0)
Add setting for Gemini 2.0 context window to llm.py
---------
Co-authored-by: Brandon Hancock (bhancock_ai) <109994880+bhancockio@users.noreply.github.com>
Copy file name to clipboardexpand all lines: docs/concepts/llms.mdx
+11-6
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ Large Language Models (LLMs) are the core intelligence behind CrewAI agents. The
29
29
30
30
## Available Models and Their Capabilities
31
31
32
-
Here's a detailed breakdown of supported models and their capabilities, you can compare performance at [lmarena.ai](https://lmarena.ai/):
32
+
Here's a detailed breakdown of supported models and their capabilities, you can compare performance at [lmarena.ai](https://lmarena.ai/?leaderboard) and [artificialanalysis.ai](https://artificialanalysis.ai/):
33
33
34
34
<Tabs>
35
35
<Tabtitle="OpenAI">
@@ -121,12 +121,18 @@ Here's a detailed breakdown of supported models and their capabilities, you can
121
121
<Tabtitle="Gemini">
122
122
| Model | Context Window | Best For |
123
123
|-------|---------------|-----------|
124
-
| Gemini 1.5 Flash | 1M tokens | Balanced multimodal model, good for most tasks |
125
-
| Gemini 1.5 Flash 8B | 1M tokens | Fastest, most cost-efficient, good for high-frequency tasks |
126
-
| Gemini 1.5 Pro | 2M tokens | Best performing, wide variety of reasoning tasks including logical reasoning, coding, and creative collaboration |
124
+
| gemini-2.0-flash-exp | 1M tokens | Higher quality at faster speed, multimodal model, good for most tasks |
125
+
| gemini-1.5-flash | 1M tokens | Balanced multimodal model, good for most tasks |
126
+
| gemini-1.5-flash-8B | 1M tokens | Fastest, most cost-efficient, good for high-frequency tasks |
127
+
| gemini-1.5-pro | 2M tokens | Best performing, wide variety of reasoning tasks including logical reasoning, coding, and creative collaboration |
127
128
128
129
<Tip>
129
130
Google's Gemini models are all multimodal, supporting audio, images, video and text, supporting context caching, json schema, function calling, etc.
131
+
132
+
These models are available via API_KEY from
133
+
[The Gemini API](https://ai.google.dev/gemini-api/docs) and also from
134
+
[Google Cloud Vertex](https://cloud.google.com/vertex-ai/generative-ai/docs/migrate/migrate-google-ai) as part of the
0 commit comments