Commit b03ac1d
Pradyun Ramadorai
fix: Restore critical USE_CUTLASS_MOE environment variable support
ISSUE: The USE_CUTLASS_MOE environment variable support (CLAUDE.md entry vllm-project#14)
was lost during a previous merge, removing critical debugging/compatibility control.
ROOT CAUSE: Upstream changes overwrote the Mantle modification that added
environment variable control for CUTLASS MoE implementations.
SOLUTION: Restored the missing environment variable logic:
- Added `import os` to imports
- Restored `default_use_cutlass` calculation with original conditions
- Restored `USE_CUTLASS_MOE` environment variable with smart defaults:
* USE_CUTLASS_MOE=1 forces CUTLASS MoE on (default when conditions met)
* USE_CUTLASS_MOE=0 disables CUTLASS MoE, fallback to other implementations
- Maintains backward compatibility with automatic detection
CODE CHANGES:
- File: `vllm/model_executor/layers/quantization/compressed_tensors/compressed_tensors_moe.py`
- Lines: 5 (import), 547-556 (environment variable logic)
- Annotation: Added comprehensive Mantle modification comments for future merge guidance
TESTING: Verified import functionality and environment variable integration.
This fix enables debugging and compatibility control for CUTLASS MoE implementations
as documented in CLAUDE.md registry entry vllm-project#14.
Signed-off-by: Pradyun Ramadorai <pradyunr@amazon.com>1 parent 6a04908 commit b03ac1d
File tree
1 file changed
+9
-1
lines changed- vllm/model_executor/layers/quantization/compressed_tensors
1 file changed
+9
-1
lines changedLines changed: 9 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
5 | 6 | | |
6 | 7 | | |
7 | 8 | | |
| |||
544 | 545 | | |
545 | 546 | | |
546 | 547 | | |
547 | | - | |
| 548 | + | |
| 549 | + | |
| 550 | + | |
| 551 | + | |
| 552 | + | |
| 553 | + | |
| 554 | + | |
548 | 555 | | |
549 | 556 | | |
| 557 | + | |
550 | 558 | | |
551 | 559 | | |
552 | 560 | | |
| |||
0 commit comments