Skip to content

Commit 3f09c39

Browse files
authored
fix: fix #2653: links of h100_prefill_performance.png and h100_decode_performance.png (#2650)
1 parent a68c2f8 commit 3f09c39

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/benchmarks/pre_deployment_profiling.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -23,8 +23,8 @@ The script will first detect the number of available GPUs on the current nodes (
2323

2424
After the profiling finishes, two plots will be generated in the `output-dir`. For example, here are the profiling results for `examples/llm/configs/disagg.yaml`:
2525

26-
![Prefill Performance](../images/h100_prefill_performance.png)
27-
![Decode Performance](../images/h100_decode_performance.png)
26+
![Prefill Performance](../../docs/images/h100_prefill_performance.png)
27+
![Decode Performance](../../docs/images/h100_decode_performance.png)
2828

2929
For the prefill performance, the script will plot the TTFT for different TP sizes and select the best TP size that meet the target TTFT SLA and delivers the best throughput per GPU. Based on how close the TTFT of the selected TP size is to the SLA, the script will also recommend the upper and lower bounds of the prefill queue size to be used in planner.
3030

0 commit comments

Comments
 (0)