Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with azure benchmarking tool on passing input tokens greater than 30k #54

Open
rajm3180 opened this issue Jul 29, 2024 · 1 comment

Comments

@rajm3180
Copy link

We are trying to run our benchmarking exercise using the benchmarking tool for gpt4o but getting below error on passing the input context greater than 30k from the command line

Command:
python -m benchmark.bench load --temperature 0.0 --shape-profile custom --deployment 'deployment name' --max-tokens 200 --context-tokens 35000 --api-version 2024-02-01 --rate 10 --duration 600 https://genai-stg-westus3-1.openai.azure.com/

Error:
getting error as "There aren't enough words to choose from. Cannot generate 15991 word(s)

@michaeltremeer
Copy link
Contributor

Hey @rajm3180 , this repo is no longer maintained and it is suggested to use the improved fork here: https://aka.ms/ImprovedAOAIBenchmarkingTool.

The fork has already fixed this issue and it is actively maintained, with a large number of fixes and improvements compared to this repo. Make sure to read the README to understand all of the updates that exist.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants