You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
In vLLM v1, use_cudagraph was always overridden to be True. I was trying
to debug some silent incorrectness and wanted to turn it off. This PR
re-enables the use_cudagraphs in vLLM v1 (and does some variable
renaming).
Also, the new test file runs the test explicitly with VLLM_USE_V1
because it turns out that the test/compile folder doesn't use V1 yet.
Test Plan:
`pytest tests/compile/test_config -v -k "use_cudagraphs"`
Signed-off-by: Richard Zou <zou3519@gmail.com>
0 commit comments