We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent ee75e2c commit b95f5eaCopy full SHA for b95f5ea
docs/llama2.md.template
@@ -27,6 +27,7 @@
27
| [PyTorch Lightning](/bench_lightning/) | 24.85 ± 0.07 | 44.56 ± 2.89 | 10.50 ± 0.12 | 24.83 ± 0.05 |
28
| [Optimum Nvidia](/bench_optimum_nvidia/) | 110.36 ± 0.52| 109.09 ± 4.26 | - | - |
29
| [Nvidia TensorRT-LLM](/bench_tensorrtllm/) | 55.19 ± 1.03 | 85.03 ± 0.62 | 167.66 ± 2.05 | 235.18 ± 3.20 |
30
+
31
*(Data updated: `<LAST_UPDATE>`)
32
33
0 commit comments