Skip to content

Commit f3dddfb

Browse files
committed
merge origin/main into Ronald1995/async_scheduling_for_spec_decode
Signed-off-by: Ronald1995 <ronaldautomobile@163.com>
2 parents ebf0674 + d4e547b commit f3dddfb

File tree

135 files changed

+5267
-1295
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

135 files changed

+5267
-1295
lines changed

.buildkite/test-amd.yaml

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -629,15 +629,16 @@ steps:
629629

630630
- label: OpenAI API correctness # 22min
631631
timeout_in_minutes: 30
632-
mirror_hardwares: [amdexperimental]
632+
mirror_hardwares: [amdexperimental, amdproduction]
633633
agent_pool: mi325_1
634634
# grade: Blocking
635635
source_file_dependencies:
636636
- csrc/
637637
- vllm/entrypoints/openai/
638638
- vllm/model_executor/models/whisper.py
639-
commands: # LMEval+Transcription WER check
640-
- pytest -s entrypoints/openai/correctness/
639+
commands: # LMEval
640+
# Transcription WER check is skipped because encoder-decoder models are not supported on ROCm, see https://github.com/vllm-project/vllm/issues/27442
641+
- pytest -s entrypoints/openai/correctness/ --ignore entrypoints/openai/correctness/test_transcription_api_correctness.py
641642

642643
- label: OpenAI-Compatible Tool Use # 23 min
643644
timeout_in_minutes: 35

.gitignore

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -221,3 +221,6 @@ csrc/moe/marlin_moe_wna16/kernel_*
221221

222222
# Ignore ep_kernels_workspace folder
223223
ep_kernels_workspace/
224+
225+
# Allow tracked library source folders under submodules (e.g., benchmarks/lib)
226+
!vllm/benchmarks/lib/

0 commit comments

Comments
 (0)