Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract model costs into log and CSVs #210

Closed
2 tasks done
bauersimon opened this issue Jun 24, 2024 · 1 comment
Closed
2 tasks done

Extract model costs into log and CSVs #210

bauersimon opened this issue Jun 24, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@bauersimon
Copy link
Member

bauersimon commented Jun 24, 2024

  • Extract model costs (per M request token + per M response token + per request + per response) and write into csv reports
  • Check other API providers too (@bauersimon knows about that e.g. Mistral, Fireworks, Groq, ...)
@bauersimon bauersimon added the enhancement New feature or request label Jun 24, 2024
@bauersimon bauersimon added this to the v0.6.0 milestone Jun 24, 2024
@zimmski zimmski changed the title Extract model costs from openrouter.ai Extract model costs into log and CSVs Jun 24, 2024
ruiAzevedo19 added a commit that referenced this issue Jun 25, 2024
…mation can be extracted from the API response

Part of #210
ruiAzevedo19 added a commit that referenced this issue Jun 25, 2024
ruiAzevedo19 added a commit that referenced this issue Jun 25, 2024
ruiAzevedo19 added a commit that referenced this issue Jun 25, 2024
ruiAzevedo19 added a commit that referenced this issue Jun 25, 2024
ruiAzevedo19 added a commit that referenced this issue Jun 26, 2024
… can be extracted from the API response

Part of #210
ruiAzevedo19 added a commit that referenced this issue Jun 26, 2024
@ruiAzevedo19
Copy link
Contributor

ruiAzevedo19 commented Jun 27, 2024

Groq Cloud

  • source: https://console.groq.com/docs/models
  • Only supports the following models (pricing: per 1M tokens, input/output):
    • LLaMA3 8b: $0.05/$0.08
    • LLaMA3 70b: $0.59/$0.79
    • Mixtral 8x7b: $0.24/$0.24
    • Gemma 7b: $0.07/$0.07
    • Whisper: $0.03/hour transcribed
    • Note: does not have a model which is in our top 10

Fireworks

  • source: https://fireworks.ai/pricing
  • pricing $/1M tokens (both input and output tokens)
  • 0B - 16B: $0.20
  • 16.1B - 80B: $0.90
  • MoE 0B - 56B (e.g. Mixtral 8x7B): $0.50
  • MoE 56.1B - 176B (e.g. DBRX, Mixtral 8x22B): $1.20
  • Yi Large: $3.00
  • Note: just has one model which is in our top 10: llama-3-70b-instruct

Mistral

  • Only supports the following models (pricing: per 1M tokens, input/output):
    • open-mistral-7b: $0.25/$0.25
    • open-mixtral-8x7b: $0.7/$0.7
    • open-mixtral-8x22b: $2/$6
    • mistral-small-2402: $1/$3
    • codestral-2405: $1/$3
    • mistral-medium-2312: $2.7/$8.1
    • mistral-large-2402: $4/$12
    • mistral-embed: $0.1/$0.1
    • Mistral 7B: $0.75/$0.75
    • Mistral Small: $2.5/$7.5
    • Note: does not have a model which is in our top 10

Notes

  • Looking at the Groq and Mistral JSON schema, there is no pricing information for models
    • All the API's require an API key even to list the models
  • Fireworks also is missing pricing information for models

Munsio pushed a commit that referenced this issue Jun 27, 2024
… can be extracted from the API response

Part of #210
Munsio pushed a commit that referenced this issue Jun 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants