Extract model costs into log and CSVs, so the pricing information is always available #216

ruiAzevedo19 · 2024-06-25T15:28:15Z

Part of #210

provider/openrouter/openrouter.go

model/model.go

provider/openrouter/openrouter.go

evaluate/report/csv.go

cmd/eval-dev-quality/cmd/evaluate.go

cmd/eval-dev-quality/cmd/evaluate_test.go

bauersimon · 2024-06-26T07:02:32Z

Please try this out with the cheapest model from openrouter and post the CSV here to see how it looks like.

provider/openrouter/openrouter.go

… can be extracted from the API response Part of #210

…always available Part of #210

ruiAzevedo19 · 2024-06-26T09:45:44Z

@bauersimon These are the results

Command: eval-dev-quality evaluate --runs 1 --repository golang/plain --model openrouter/meta-llama/llama-3-8b-instruct
Model info: https://openrouter.ai/models/meta-llama/llama-3-8b-instruct
Current price:
- Prompt: $0.07/M input tokens = $0.00000007/input token
- Completion: $0.07/M output tokens = $0.00000007/output token
- Total cost should be: $0.00000014

evaluation.csv

model,cost,language,repository,task,score,coverage,files-executed,generate-tests-for-file-character-count,processing-time,response-character-count,response-no-error,response-no-excess,response-with-code
openrouter/meta-llama/llama-3-8b-instruct,0.00000014,golang,golang/plain,write-tests,1,0,0,87,1186,90,1,0,0

evaluation.log

(...)
2024/06/26 10:35:47 Evaluation score for "openrouter/meta-llama/llama-3-8b-instruct" ("response-no-code"): cost=0.00, score=1, coverage=0, files-executed=0, generate-tests-for-file-character-count=87, processing-time=1186, response-character-count=90, response-no-error=1, response-no-excess=0, response-with-code=0

golang-summed.csv

model,cost,score,coverage,files-executed,generate-tests-for-file-character-count,processing-time,response-character-count,response-no-error,response-no-excess,response-with-code
openrouter/meta-llama/llama-3-8b-instruct,0.00000014,1,0,0,87,1186,90,1,0,0

models-summed.csv

model,cost,score,coverage,files-executed,generate-tests-for-file-character-count,processing-time,response-character-count,response-no-error,response-no-excess,response-with-code
openrouter/meta-llama/llama-3-8b-instruct,0.00000014,1,0,0,87,1186,90,1,0,0

bauersimon · 2024-06-26T09:47:35Z

Awesome. The cost in the log is kinda useless but it should be higher for more expensive models anyways. Just need to remember to scale them up for our evaluations then.

ruiAzevedo19 force-pushed the 210-model-costs branch from 3694f8b to dd0a858 Compare June 25, 2024 15:39

ruiAzevedo19 requested a review from bauersimon June 25, 2024 15:43

ruiAzevedo19 force-pushed the 210-model-costs branch 2 times, most recently from 12ac8e9 to de43bdb Compare June 25, 2024 16:01

ruiAzevedo19 self-assigned this Jun 25, 2024

ruiAzevedo19 added the enhancement New feature or request label Jun 25, 2024

ruiAzevedo19 added this to the v0.6.0 milestone Jun 25, 2024

ruiAzevedo19 mentioned this pull request Jun 25, 2024

Extract model names, to obtain a human-readable name for each model #217

Merged

bauersimon requested changes Jun 26, 2024

View reviewed changes

ruiAzevedo19 commented Jun 26, 2024

View reviewed changes

provider/openrouter/openrouter.go Outdated Show resolved Hide resolved

ruiAzevedo19 added 2 commits June 26, 2024 10:08

Custom HTTP request to list openrouter models, so pricing information…

c8ca3a9

… can be extracted from the API response Part of #210

Extract model costs into log and CSVs, so the pricing information is …

27cb52c

…always available Part of #210

ruiAzevedo19 force-pushed the 210-model-costs branch from de43bdb to 27cb52c Compare June 26, 2024 09:09

ruiAzevedo19 requested a review from bauersimon June 26, 2024 09:09

bauersimon approved these changes Jun 26, 2024

View reviewed changes

bauersimon merged commit 0af4eab into main Jun 26, 2024
4 checks passed

bauersimon deleted the 210-model-costs branch June 26, 2024 09:47

bauersimon mentioned this pull request Jul 31, 2024

Roadmap for v0.6.0 #195

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract model costs into log and CSVs, so the pricing information is always available #216

Extract model costs into log and CSVs, so the pricing information is always available #216

ruiAzevedo19 commented Jun 25, 2024

bauersimon commented Jun 26, 2024

ruiAzevedo19 commented Jun 26, 2024

bauersimon commented Jun 26, 2024

Extract model costs into log and CSVs, so the pricing information is always available #216

Extract model costs into log and CSVs, so the pricing information is always available #216

Conversation

ruiAzevedo19 commented Jun 25, 2024

bauersimon commented Jun 26, 2024

ruiAzevedo19 commented Jun 26, 2024

bauersimon commented Jun 26, 2024