You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* [kernel][DS-R1][linear] use default Fp8LinearMethod/Fp8MoEMethod
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
* [kernel][DS-R1][Attention] enable Triton MLA attention
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
* enable MHA for deepseek, need padding head_size to make flash attn kernel happy
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
* not break fp8 path
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
---------
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
0 commit comments