Skip to content

Commit 74a44d0

Browse files
committed
fix test v1
Signed-off-by: wangli <wangli858794774@gmail.com>
1 parent 55c377a commit 74a44d0

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

tests/test_offline_inference.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,8 +59,10 @@ def test_models(model: str, use_v1: str, dtype: str, max_tokens: int,
5959
vllm_model.generate_greedy(example_prompts, max_tokens)
6060

6161

62+
# Now our pvc reading speed is too slow
63+
# For faster testing, temporarily uncheck the support for testing large weight models on v1
6264
@pytest.mark.multinpu
63-
@pytest.mark.parametrize("use_v1", ["1", "0"])
65+
@pytest.mark.parametrize("use_v1", ["0"])
6466
@pytest.mark.parametrize("model, distributed_executor_backend", [
6567
("Qwen/QwQ-32B", "mp"),
6668
])

0 commit comments

Comments
 (0)