feat: Add automatic nightly benchmarks #2591

Hugoch · 2024-09-30T15:59:04Z

What does this PR do?

This PR adds automated load tests in CI using custom benchmarkedtool.

Tests are performed using a Constant arrival rate load test: It simulates a constant rate of user requests arrival, independent of the system’s response rate, during 120 seconds.

The test is run for using a sample of ShareGPT.

Test compute the following metrics:

Inter token latency: Time to generate a new output token for each user querying the system. It translates as the “speed” perceived by the end-user. We aim for at least 300 words per minute (average reading speed), so ITL<150ms
Time to First Token: Time the user has to wait before seeing the first token of its answer. Lower waiting time are essential for real-time interactions, less so for offline workloads.
End-to-end latency: The overall time the system took to generate the full response to the user.
Throughput: The number of tokens per second the system can generate across all requests
Successful requests: The number of requests the system was able to honor in the benchmark timeframe
Error rate: The percentage of requests that ended up in error, as the system could not process them in time or failed to process them.

At the end of the test, it produces a dashboard with the results and plots.

Results are added to https://huggingface.co/spaces/huggingface/tgi-benchmarks

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

feat: Add automatic nightly benchmarks

fc7dcb0

Hugoch self-assigned this Sep 30, 2024

Hugoch force-pushed the feat/ci-benchmarks branch 25 times, most recently from 22db246 to beb1cf1 Compare October 1, 2024 13:41

fix: Update runners group

2980720

Hugoch force-pushed the feat/ci-benchmarks branch from beb1cf1 to 2980720 Compare October 1, 2024 15:58

fix: add created_at field to results

d30266d

fix: Add variable results file location

6ae0467

Hugoch marked this pull request as ready for review October 9, 2024 14:47

Hugoch requested review from OlivierDehaene and Narsil October 9, 2024 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add automatic nightly benchmarks #2591

feat: Add automatic nightly benchmarks #2591

Hugoch commented Sep 30, 2024

feat: Add automatic nightly benchmarks #2591

Are you sure you want to change the base?

feat: Add automatic nightly benchmarks #2591

Conversation

Hugoch commented Sep 30, 2024

What does this PR do?

Before submitting

Who can review?