Skip to content

Commit

Permalink
[Kernel] Split Marlin MoE kernels into multiple files (vllm-project#8661
Browse files Browse the repository at this point in the history
)

Co-authored-by: mgoin <michael@neuralmagic.com>
  • Loading branch information
2 people authored and liuyanyi committed Oct 6, 2024
1 parent 68121b5 commit 9d0ec06
Show file tree
Hide file tree
Showing 7 changed files with 1,552 additions and 1,427 deletions.
5 changes: 5 additions & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -316,6 +316,11 @@ set(VLLM_MOE_EXT_SRC

if(VLLM_GPU_LANG STREQUAL "CUDA")
list(APPEND VLLM_MOE_EXT_SRC
"csrc/moe/marlin_kernels/marlin_moe_kernel.h"
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku4b8.h"
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku4b8.cu"
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku8b128.h"
"csrc/moe/marlin_kernels/marlin_moe_kernel_ku8b128.cu"
"csrc/moe/marlin_moe_ops.cu")
endif()

Expand Down
Loading

0 comments on commit 9d0ec06

Please sign in to comment.