Files

.buildkite
.github
benchmarks
- cutlass_benchmarks
- disagg_benchmarks
- fused_kernels
- kernels
- overheads
- profiling
- structured_schemas
- P3L.py
- P3L_mling.py
- README.md
- backend_request_func.py
- benchmark_guided.py
- benchmark_latency.py
- benchmark_long_document_qa_throughput.py
- benchmark_prefix_caching.py
- benchmark_prioritization.py
- benchmark_serving.py
- benchmark_serving_guided.py
- benchmark_throughput.py
- benchmark_utils.py
- launch_tgi_server.sh
- sonnet.txt
cmake
csrc
docs
examples
gradlib
rocm_patch
tests
tools
vllm
.clang-format
.dockerignore
.gitignore
.pre-commit-config.yaml
.readthedocs.yaml
.shellcheckrc
.yapfignore
CMakeLists.txt
CODE_OF_CONDUCT.md
CONTRIBUTING.md
DCO
Dockerfile
Dockerfile.arm
Dockerfile.base_navi
Dockerfile.cpu
Dockerfile.hpu
Dockerfile.neuron
Dockerfile.openvino
Dockerfile.ppc64le
Dockerfile.rocm
Dockerfile.rocm_base
Dockerfile.tpu
Dockerfile.xpu
LICENSE
MANIFEST.in
README.md
ROCm_performance.md
SECURITY.md
collect_env.py
find_cuda_init.py
format.sh
pyproject.toml
python_only_dev.py
requirements-build.txt
requirements-common.txt
requirements-cpu.txt
requirements-cuda.txt
requirements-dev.txt
requirements-hpu.txt
requirements-lint.txt
requirements-neuron.txt
requirements-openvino.txt
requirements-rocm-build.txt
requirements-rocm.txt
requirements-test.in
requirements-test.txt
requirements-tpu.txt
requirements-xpu.txt
setup.py
setup_cython.py
use_existing_torch.py

benchmarks

gshtras

Merge remote-tracking branch 'upstream/main' into upstream_merge_25_0…

Feb 17, 2025

ce342c7 · Feb 17, 2025

History

This branch is 642 commits ahead of, 29 commits behind vllm-project/vllm:main.

Name	Name	Last commit message	Last commit date
parent directory ..
cutlass_benchmarks	cutlass_benchmarks	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
disagg_benchmarks	disagg_benchmarks	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
fused_kernels	fused_kernels	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
kernels	kernels	Merge remote-tracking branch 'upstream/main' into upstream_merge_25_0…	Feb 3, 2025
overheads	overheads	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
profiling	profiling	Lint	Feb 12, 2025
structured_schemas	structured_schemas	[Benchmark] Benchmark structured output with datasets (vllm-project#1…	Dec 4, 2024
P3L.py	P3L.py	Fixing the output formatting (#414 )	Feb 10, 2025
P3L_mling.py	P3L_mling.py	Fixing the output formatting (#414 )	Feb 10, 2025
README.md	README.md	[Benchmark] Add BurstGPT to benchmark_serving (vllm-project#13063 )	Feb 11, 2025
backend_request_func.py	backend_request_func.py	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
benchmark_guided.py	benchmark_guided.py	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
benchmark_latency.py	benchmark_latency.py	Run v1 benchmark and integrate with PyTorch OSS benchmark database (v…	Feb 17, 2025
benchmark_long_document_qa_throughput.py	benchmark_long_document_qa_throughput.py	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
benchmark_prefix_caching.py	benchmark_prefix_caching.py	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
benchmark_prioritization.py	benchmark_prioritization.py	[Misc] Add SPDX-License-Identifier headers to python source files (vl…	Feb 2, 2025
benchmark_serving.py	benchmark_serving.py	Run v1 benchmark and integrate with PyTorch OSS benchmark database (v…	Feb 17, 2025
benchmark_serving_guided.py	benchmark_serving_guided.py	Run v1 benchmark and integrate with PyTorch OSS benchmark database (v…	Feb 17, 2025
benchmark_throughput.py	benchmark_throughput.py	Run v1 benchmark and integrate with PyTorch OSS benchmark database (v…	Feb 17, 2025
benchmark_utils.py	benchmark_utils.py	Run v1 benchmark and integrate with PyTorch OSS benchmark database (v…	Feb 17, 2025
launch_tgi_server.sh	launch_tgi_server.sh	[CI/Build] Add shell script linting using shellcheck (vllm-project#7925 )	Nov 7, 2024
sonnet.txt	sonnet.txt	feat(benchmarks): Add Prefix Caching Benchmark to Serving Benchmark (v…	Mar 27, 2024

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

You can download the dataset by running:

wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json

Downloading the ShareGPT4V dataset

The json file refers to several image datasets (coco, llava, etc.). The benchmark scripts will ignore a datapoint if the referred image is missing.

wget https://huggingface.co/datasets/Lin-Chen/ShareGPT4V/resolve/main/sharegpt4v_instruct_gpt4-vision_cap100k.json
mkdir coco -p
wget http://images.cocodataset.org/zips/train2017.zip -O coco/train2017.zip
unzip coco/train2017.zip -d coco/

Downloading the BurstGPT dataset

You can download the BurstGPT v1.1 dataset by running:

wget https://github.com/HPMLL/BurstGPT/releases/download/v1.1/BurstGPT_without_fails_2.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmarks

benchmarks

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

Downloading the ShareGPT4V dataset

Downloading the BurstGPT dataset

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmarking vLLM

Downloading the ShareGPT dataset

Downloading the ShareGPT4V dataset

Downloading the BurstGPT dataset