You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: remove the alpha suffix in run_benchmark.py (#179)
## What does this PR do?
Remove the alpha suffix of evaluate_rows after the new pkg release
## Test Plan
test with `llama-stack-client --endpoint xxx eval run-benchmark
"meta-reference-mmlu-cot" --model-id "meta-llama/Llama-3.1-8B-Instruct"
--output-dir "/home/markchen1015/" --num-examples 5` and the eval
finished successfully

0 commit comments