We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
It be nice if next to tokens/s add "rate of response generated in seconds". Then will be good benchmark to check speed of generate model.
Now: 14.50 tokens/s
Upgrade: 14.50 tokens/s • 10s
The text was updated successfully, but these errors were encountered:
ui: show time taken to generate response #7
e7beb32
ffb93a8
shubham0204
Successfully merging a pull request may close this issue.
It be nice if next to tokens/s add "rate of response generated in seconds".
Then will be good benchmark to check speed of generate model.
Now:
14.50 tokens/s
Upgrade:
14.50 tokens/s • 10s
The text was updated successfully, but these errors were encountered: