[Feature] add kernel level benchmark #2402

zhyncs · 2024-12-08T11:06:34Z

Checklist

1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
2. Please use English, otherwise it will be closed.

Motivation

use triton benchmark utils https://triton-lang.org/main/python-api/generated/triton.testing.do_bench.html#triton.testing.do_bench to benchmark kernels (flashinfer, triton, vllm, tensorrt llm, cudnn etc)

Related resources

No response

zhyncs · 2024-12-08T11:11:02Z

Currently, the main branch already has some e2e benchmarks, but this is far from enough for us. We need kernel-level benchmarks, ranging from small batch size to large batch size, various shapes, and data types. Compared to existing libraries, what level are we at, and how much room for optimization is there?

zhyncs · 2024-12-15T05:52:47Z

ref #2486

zhyncs added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed high priority flashinfer labels Dec 8, 2024

zhyncs assigned ispobock and zhyncs Dec 8, 2024

bjmsong mentioned this issue Dec 9, 2024

decoding attention kernel benchmark #2425

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] add kernel level benchmark #2402

[Feature] add kernel level benchmark #2402

zhyncs commented Dec 8, 2024

zhyncs commented Dec 8, 2024

zhyncs commented Dec 15, 2024

[Feature] add kernel level benchmark #2402

[Feature] add kernel level benchmark #2402

Comments

zhyncs commented Dec 8, 2024

Checklist

Motivation

Related resources

zhyncs commented Dec 8, 2024

zhyncs commented Dec 15, 2024