Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

💄 style: update groq model list #4195

Merged
merged 3 commits into from
Sep 29, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
41 changes: 33 additions & 8 deletions src/config/modelProviders/groq.ts
Original file line number Diff line number Diff line change
Expand Up @@ -3,11 +3,39 @@ import { ModelProviderCard } from '@/types/llm';
// ref https://console.groq.com/docs/tool-use
const Groq: ModelProviderCard = {
chatModels: [
// TODO: During preview launch, Groq is limiting 3.1 models to max_tokens of 8k.
// TODO: During preview launch, Groq is limiting 3.2 models to max_tokens of 8k.
{
description:
'Llama 3.2 旨在处理结合视觉和文本数据的任务。它在图像描述和视觉问答等任务中表现出色,跨越了语言生成和视觉推理之间的鸿沟。',
displayName: 'Llama 3.2 11B Vision (Preview)',
arvinxx marked this conversation as resolved.
Show resolved Hide resolved
enabled: true,
id: 'llama-3.2-11b-vision-preview',
maxOutput: 8192,
pricing: {
input: 0.05,
output: 0.08,
},
tokens: 8192,
vision: true,
},
{
description:
'Llama 3.2 旨在处理结合视觉和文本数据的任务。它在图像描述和视觉问答等任务中表现出色,跨越了语言生成和视觉推理之间的鸿沟。',
displayName: 'Llama 3.2 90B Vision (Preview)',
enabled: true,
id: 'llama-3.2-90b-vision-preview',
maxOutput: 8192,
pricing: {
input: 0.59,
output: 0.79,
},
tokens: 8192,
vision: true,
},
{
description:
'Llama 3.1 8B 是一款高效能模型,提供了快速的文本生成能力,非常适合需要大规模效率和成本效益的应用场景。',
displayName: 'Llama 3.1 8B (Preview)',
displayName: 'Llama 3.1 8B',
enabled: true,
functionCall: true,
id: 'llama-3.1-8b-instant',
Expand All @@ -21,7 +49,7 @@ const Groq: ModelProviderCard = {
{
description:
'Llama 3.1 70B 提供更强大的AI推理能力,适合复杂应用,支持超多的计算处理并保证高效和准确率。',
displayName: 'Llama 3.1 70B (Preview)',
displayName: 'Llama 3.1 70B',
enabled: true,
functionCall: true,
id: 'llama-3.1-70b-versatile',
Expand All @@ -35,7 +63,7 @@ const Groq: ModelProviderCard = {
/*
// Offline due to overwhelming demand! Stay tuned for updates.
{
displayName: 'Llama 3.1 405B (Preview)',
displayName: 'Llama 3.1 405B',
functionCall: true,
id: 'llama-3.1-405b-reasoning',
tokens: 8_192,
Expand All @@ -44,7 +72,6 @@ const Groq: ModelProviderCard = {
{
description: 'Llama 3 Groq 8B Tool Use 是针对高效工具使用优化的模型,支持快速并行计算。',
displayName: 'Llama 3 Groq 8B Tool Use (Preview)',
enabled: true,
functionCall: true,
id: 'llama3-groq-8b-8192-tool-use-preview',
pricing: {
Expand All @@ -56,7 +83,6 @@ const Groq: ModelProviderCard = {
{
description: 'Llama 3 Groq 70B Tool Use 提供强大的工具调用能力,支持复杂任务的高效处理。',
displayName: 'Llama 3 Groq 70B Tool Use (Preview)',
enabled: true,
functionCall: true,
id: 'llama3-groq-70b-8192-tool-use-preview',
pricing: {
Expand Down Expand Up @@ -124,13 +150,12 @@ const Groq: ModelProviderCard = {
{
arvinxx marked this conversation as resolved.
Show resolved Hide resolved
description: 'LLaVA 1.5 7B 提供视觉处理能力融合,通过视觉信息输入生成复杂输出。',
displayName: 'LLaVA 1.5 7B',
enabled: true,
id: 'llava-v1.5-7b-4096-preview',
tokens: 4096,
vision: true,
},
],
checkModel: 'gemma2-9b-it',
checkModel: 'llama-3.1-8b-instant',
description:
'Groq 的 LPU 推理引擎在最新的独立大语言模型(LLM)基准测试中表现卓越,以其惊人的速度和效率重新定义了 AI 解决方案的标准。Groq 是一种即时推理速度的代表,在基于云的部署中展现了良好的性能。',
id: 'groq',
Expand Down
Loading