Commit 62a33c2
[Serve.LLM] Add avg prompt length metric (#58599)
## Description
Add avg prompt length metric
When using uniform prompt length (especially in testing), the P50 and
P90 computations are skewed due to the 1_2_5 buckets used in vLLM.
Average prompt length provides another useful dimension to look at and
validate.
For example, using uniformly ISL=5000, P50 shows 7200 and P90 shows
9400, and avg accurately shows 5000.
<img width="1186" height="466" alt="image"
src="https://github.com/user-attachments/assets/4615c3ca-2e15-4236-97f9-72bc63ef9d1a"
/>
## Related issues
## Additional information
---------
Signed-off-by: Rui Qiao <ruisearch42@gmail.com>
Signed-off-by: Rui Qiao <161574667+ruisearch42@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>1 parent 0c4dcb0 commit 62a33c2
File tree
1 file changed
+4
-0
lines changed- python/ray/dashboard/modules/metrics/dashboards
1 file changed
+4
-0
lines changedLines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
223 | 223 | | |
224 | 224 | | |
225 | 225 | | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
226 | 230 | | |
227 | 231 | | |
228 | 232 | | |
| |||
0 commit comments