Mamba V2 Test not Asserting Failures. #21379

fabianlim · 2025-07-22T14:12:14Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

The test_mamba_ssm_ssd introduced in #10909 is currently problematic as it does not assert failures. Therefore the tests were not properly checked and were actually failing.

to make the tests pass, the thresholds are adjusted in this PR, tested on H100.
the threshold for test_mamba_chunk_scan_cont_batch is adjusted now to quite high, but the plan is to revisit this later

cc: @tdoublep @tlrmchlsmth @cyang49

Test Plan

pytest tests/kernels/mamba/test_mamba_ssm_ssd.py

Test Result

(Optional) Documentation Update

gemini-code-assist

Code Review

This pull request correctly fixes an issue where tests were not asserting failures by replacing torch.allclose with torch.testing.assert_close. My main feedback is on the error tolerances, which have been significantly increased to make the tests pass. In one case, the tolerance is so high (50%) that it may render the test less effective at catching meaningful regressions. I've added comments suggesting ways to handle this, such as using conditional tolerances for different data types and adding TODOs to ensure they are tracked and addressed in the future.

gemini-code-assist · 2025-07-22T14:13:55Z

tests/kernels/mamba/test_mamba_ssm_ssd.py

The tolerances in this test have been increased significantly. While this might be necessary to make the test pass with float16 on newer hardware, it's a very loose check, especially for float32. A 5% deviation can mask meaningful numerical issues. Could you please check if float32 can pass with a tighter tolerance? If so, it would be best to use different tolerances based on the itype.

Suggested change

torch.testing.assert_close(Y[:, -1], Y_min[:, -1], atol=5e-2, rtol=5e-2)

if itype == torch.float16:

y_atol, y_rtol = 5e-2, 5e-2

state_atol, state_rtol = 1e-2, 5e-2

else:

# Use original, tighter tolerances for float32

y_atol, y_rtol = 1e-3, 1e-3

state_atol, state_rtol = 1e-3, 1e-3

torch.testing.assert_close(Y[:, -1], Y_min[:, -1], atol=y_atol, rtol=y_rtol)

torch.testing.assert_close(final_state[:, -1],

final_state_min[:, -1].to(torch.float32),

atol=state_atol,

rtol=state_rtol)

addressed in #21379

gemini-code-assist · 2025-07-22T14:13:55Z

tests/kernels/mamba/test_mamba_ssm_ssd.py

The new tolerance values atol=5e-1 and rtol=5e-1 are extremely high, allowing for a 50% relative difference, which may mask significant numerical discrepancies or bugs in the kernel implementation. Add a TODO comment in the code explaining why the tolerance is so high and that it needs to be investigated and tightened.

// TODO: Investigate and tighten tolerance. Current values are masking potential numerical issues. torch.testing.assert_close(Y_eg, Y_min_eg, atol=5e-1, rtol=5e-1)

addressed in #21379

github-actions · 2025-07-22T14:34:22Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

DarkLight1337 · 2025-07-23T07:22:50Z

cc @tdoublep

tdoublep

Some of the high tolerances need to be investigated as a follow-up, but having asserts with a high tolerance is better than having no asserts at all.

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: qizixi <qizixi@meta.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: x22x22 <wadeking@qq.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Paul Pak <paulpak58@gmail.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

fabianlim requested review from WoosukKwon and tlrmchlsmth as code owners July 22, 2025 14:12

gemini-code-assist bot reviewed Jul 22, 2025

View reviewed changes

cyang49 mentioned this pull request Jul 22, 2025

[Model] Replace Mamba2 RMSNorm Gated with Fused Triton Kernel #20839

Merged

4 tasks

fabianlim added 6 commits July 22, 2025 23:58

fix tests

515ac87

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

relax

2670f04

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

fix mixer2 test

32ce905

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

fix yapf

7ae9b85

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

adjust thresholds

83565de

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

yapf

2360bd1

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

fabianlim force-pushed the mamba-ssm-fix-tests branch from b41cacb to 2360bd1 Compare July 22, 2025 23:59

tdoublep approved these changes Jul 23, 2025

View reviewed changes

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 23, 2025

DarkLight1337 enabled auto-merge (squash) July 23, 2025 07:52

DarkLight1337 approved these changes Jul 23, 2025

View reviewed changes

vllm-bot merged commit 32ec9e2 into vllm-project:main Jul 23, 2025
43 of 44 checks passed

fabianlim deleted the mamba-ssm-fix-tests branch July 23, 2025 12:44

zixi-qi pushed a commit to zixi-qi/vllm that referenced this pull request Jul 23, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

54e6fce

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: qizixi <qizixi@meta.com>

RishiAstra mentioned this pull request Jul 28, 2025

[Bugfix] Mamba2 SSD varlen bug fix initstates decay, improve test, assert chunk pwr 2 #21783

Merged

4 tasks

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

370ff28

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: x22x22 <wadeking@qq.com>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

41b3e07

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

795f454

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

1bc16bb

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

f09f75a

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Paul Pak <paulpak58@gmail.com>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

ffba3a0

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Mamba V2 Test not Asserting Failures. (vllm-project#21379)

7a9d349

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Mamba V2 Test not Asserting Failures. #21379

Mamba V2 Test not Asserting Failures. #21379

Uh oh!

fabianlim commented Jul 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jul 22, 2025

Uh oh!

fabianlim Jul 22, 2025

Uh oh!

gemini-code-assist bot Jul 22, 2025

Uh oh!

fabianlim Jul 22, 2025

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

DarkLight1337 commented Jul 23, 2025

Uh oh!

tdoublep left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-    torch.testing.assert_close(Y[:, -1], Y_min[:, -1], atol=5e-2, rtol=5e-2)
+if itype == torch.float16:
+    y_atol, y_rtol = 5e-2, 5e-2
+    state_atol, state_rtol = 1e-2, 5e-2
+else:
+    # Use original, tighter tolerances for float32
+    y_atol, y_rtol = 1e-3, 1e-3
+    state_atol, state_rtol = 1e-3, 1e-3
+torch.testing.assert_close(Y[:, -1], Y_min[:, -1], atol=y_atol, rtol=y_rtol)
+torch.testing.assert_close(final_state[:, -1],
+                   final_state_min[:, -1].to(torch.float32),
+                   atol=state_atol,
+                   rtol=state_rtol)

Uh oh!

Mamba V2 Test not Asserting Failures. #21379

Mamba V2 Test not Asserting Failures. #21379

Uh oh!

Conversation

fabianlim commented Jul 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

fabianlim Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

fabianlim Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 22, 2025

Uh oh!

DarkLight1337 commented Jul 23, 2025

Uh oh!

tdoublep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fabianlim commented Jul 22, 2025 •

edited by github-actions bot

Loading