Commit 569b012
Low-bit kernels fix and implementation (#704)
* [MXFP4] Dequantize FP4 kernel example, MX scale todo
* [BugFix] Fix the bug of fp4&fp16 exponential bias
* [MXFP4] Add group scale factor for BF16xMXFP4 gemm
* [Lint]
* [Test] Add test script for BF16xMXFP4 gemm
* [Lint]
* [BugFix] Fix the shape of scale tensor
* Update example_dequant_gemm_fp4_hopper.py
---------
Co-authored-by: LeiWang1999 <leiwang1999@outlook.com>
Co-authored-by: Lei Wang <34334180+LeiWang1999@users.noreply.github.com>1 parent 376ba9e commit 569b012
File tree
3 files changed
+445
-10
lines changed- examples/dequantize_gemm
3 files changed
+445
-10
lines changedLines changed: 14 additions & 10 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
16 | | - | |
| 15 | + | |
| 16 | + | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
25 | 27 | | |
26 | 28 | | |
27 | 29 | | |
| |||
39 | 41 | | |
40 | 42 | | |
41 | 43 | | |
42 | | - | |
43 | | - | |
44 | | - | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
45 | 49 | | |
46 | 50 | | |
47 | 51 | | |
| |||
0 commit comments