- 
                Notifications
    You must be signed in to change notification settings 
- Fork 528
[CI] Add new runner and enable QwQ multinpu test #417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
e8aa80c    to
    dd0a9c2      
    Compare
  
    1599f82    to
    ebdad10      
    Compare
  
    | tensor_parallel_size=4, | ||
| distributed_executor_backend=distributed_executor_backend, | ||
| ) as vllm_model: | ||
| vllm_model.generate_greedy(example_prompts, max_tokens) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I remember last offline discussion, we'd also want to add transformer as compare, does it works (stable works)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will finish at next pull request
| fi | ||
| pip install /root/.cache/pta/torch_npu-2.5.1.dev20250320-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl | ||
| - name: Run vllm-project/vllm-ascend test | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
V1 engine should be tested as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we shouldn't test v1 like this. v1 doesn't fully support in some features. in addition we should mock the VLLM_USE_V1 in the related test modules if in need
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why? We only test V1 for vllm-ascend test, not vllm test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean we will construct pytest fixture to decide which cases need to be tested on v1
7b0213b    to
    59578d7      
    Compare
  
    | env: | ||
| VLLM_USE_V1: 1 | ||
| VLLM_WORKER_MULTIPROC_METHOD: spawn | ||
| VLLM_USE_V1: 0 | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why removed v1 test
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add it here @pytest.mark.parametrize("use_v1", ["1", "0"])
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
5605150    to
    83df77b      
    Compare
  
    Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
Signed-off-by: wangli <wangli858794774@gmail.com>
| @@ -1,5 +1,8 @@ | |||
| [pytest] | |||
| minversion = 6.0 | |||
| markers = | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this markers is useless now, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this markers help pytest detect what kind of cases should be run, just like pytest -m 'multinpu'
| I'll fix the nit in the follow-up PR | 
### What this PR does / why we need it? - Add a new runner to the continuous integration system and keep the original CI runner until the new runner runs stably - Add distributed test cases ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>
### What this PR does / why we need it? - Add a new runner to the continuous integration system and keep the original CI runner until the new runner runs stably - Add distributed test cases ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI passed --------- Signed-off-by: wangli <wangli858794774@gmail.com>
What this PR does / why we need it?
Does this PR introduce any user-facing change?
No
How was this patch tested?
CI passed