Commit a52a64a

authored

[Inductor][float8] Support qlinear for float8 in inductor (#2565)

* quantize_affine_float8/dequantize_affine_float8 not decomposed on inductor * remove redundant unittest.skipIf * fix rebase issue * change dispatch key to a flag decomposed * support scaled_mm on inductor * fix rebase issue * support dequant promtion for fp8 * add ut * remove redundant codes * fix lint * resolve conflict * change to use qlinear * add ut * fix lint * support fp8 quant_lift_up * add reshape into _VIEW_METHOD_OPS * add quant_input_check * fix lint * refine ut * remove fp8 dynamic quant ut * fix output_scale issue * add float8_e4m3fn to dtype_list * refine code * refine code * fix bugs * add comment * merge main * change to use non-decomposed q/dq * fix lint * add version check * change version * fix attention bug; update ut * add liftup oplist

1 parent 239e57a commit a52a64aCopy full SHA for a52a64a

2 files changed

+834

-250

lines changed

test/quantization/pt2e
- test_x86inductor_fusion.py
torchao/quantization/pt2e/inductor_passes
- x86.py

2 files changed

+834

-250

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit a52a64a

2 files changed

2 files changed

File tree

2 files changed

2 files changed

0 commit comments