Commit c406df2
Add support for MXFP8 All gather (#3435)
* add MXFP8 all gather support
* added TODO for future feature
* remove emoji from comment
* fixed ruff formating
* fixed ruff formatting
* add mxfp8 and nvfp4 to Llama eval scripts (#3394)
Update
[ghstack-poisoned]
* flip mx inference scaling setting to RCEIL (#3428)
* Update
[ghstack-poisoned]
* Update
[ghstack-poisoned]
* Update
[ghstack-poisoned]
* add CLAUDE.local.md to gitignore (#3437)
Summary:
taking claude code for a more thorough spin, will start with local
instructions and will see what makes sense to upstream
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
* bump python version in tutorial ci workflow (#3439)
* [CPU] Reland qconv fp8 fusion passes (#3433)
* [Reland][PT2E][X86] Add Inductor fusion passes of float8 qconv for X86Inductor backend
* add torch version check for Qconv FP8 UTs
* fix format issue
* Skip tests for ROCm
---------
Co-authored-by: Sun, Jiayi <jiayi.sun@intel.com>
* Int8Tensor migration cleanup (#3407)
* Int8Tensor migration
Summary:
This PR creates a new Int8Tensor and updates the configs to use the new
Int8Tensor flow
Test Plan:
To ensure BC:
```
pytest test/quantization/test_quant_api.py
```
To test new Int8Tensor:
```
pytest test/quantization/quantize_/workflows/int8/test_int8_tensor.py
```
Reviewers:
Subscribers:
Tasks:
Tags:
* ruff fixes
* add init
* fix ruff again
* update
* wip
* undo update tests
* fix ruff
* fix varname
* fix typing
* add tests
* fix dtype
* fix ci
* address granularity cr
* update _choose_quant_func_and_quantize_tensor
* make block size required attribute
* made dtype required as well
* address nits
* skip per tensor weight only test for now
* [xpu][test] Port 2 test/dtypes_{floatx, bitpacking} UT files to intel XPU (#3368)
* enable test/dtypes/test_bitpacking.py on intel xpu
* enable test/dtypes/test_floatx.py
* enable test/dtypes/test_floatx.py
* fix format issue
* fix format issue
* update _DEVICES
* [xpu][test] Port 2 test/quantization/pt2e/test_{quantize_pt2e, quantize_pt2e_qat} UT files to intel XPU (#3405)
* add test/quantization/pt2e/test_quantize_pt2e.py
* add test/quantization/pt2e/test_quantize_pt2e.py
* test/quantization/pt2e/test_quantize_pt2e_qat.py
* test/quantization/pt2e/test_quantize_pt2e_qat.py
* fix format issue
* update format
* increase timeout for xpu
* [Intel GPU] Enable optim SR test (#3055)
* updated test with rebase changes
* added checks to run only on CUDA with compatibility >=9
* updated test for H100
* added test to workflow
---------
Co-authored-by: Vasiliy Kuznetsov <vkuzo@users.noreply.github.com>
Co-authored-by: Daniel Vega-Myhre <danvm@meta.com>
Co-authored-by: Xia Weiwen <weiwen.xia@intel.com>
Co-authored-by: Sun, Jiayi <jiayi.sun@intel.com>
Co-authored-by: Jesse Cai <jessecai@meta.com>
Co-authored-by: xiangdong <40376367+zxd1997066@users.noreply.github.com>
Co-authored-by: Artur Lesniak <artur.lesniak@intel.com>1 parent 2ae2994 commit c406df2
File tree
4 files changed
+201
-0
lines changed- .github/workflows
- test/prototype/mx_formats
- torchao/prototype/mx_formats
4 files changed
+201
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
| 50 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
842 | 842 | | |
843 | 843 | | |
844 | 844 | | |
| 845 | + | |
| 846 | + | |
| 847 | + | |
| 848 | + | |
| 849 | + | |
| 850 | + | |
| 851 | + | |
| 852 | + | |
| 853 | + | |
| 854 | + | |
| 855 | + | |
| 856 | + | |
| 857 | + | |
| 858 | + | |
| 859 | + | |
| 860 | + | |
| 861 | + | |
| 862 | + | |
| 863 | + | |
| 864 | + | |
| 865 | + | |
| 866 | + | |
| 867 | + | |
| 868 | + | |
| 869 | + | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
| 877 | + | |
| 878 | + | |
| 879 | + | |
| 880 | + | |
| 881 | + | |
| 882 | + | |
| 883 | + | |
| 884 | + | |
| 885 | + | |
| 886 | + | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
| 891 | + | |
| 892 | + | |
| 893 | + | |
| 894 | + | |
| 895 | + | |
| 896 | + | |
| 897 | + | |
| 898 | + | |
| 899 | + | |
| 900 | + | |
| 901 | + | |
| 902 | + | |
| 903 | + | |
| 904 | + | |
| 905 | + | |
| 906 | + | |
| 907 | + | |
| 908 | + | |
| 909 | + | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
| 913 | + | |
| 914 | + | |
| 915 | + | |
| 916 | + | |
| 917 | + | |
| 918 | + | |
| 919 | + | |
| 920 | + | |
| 921 | + | |
| 922 | + | |
| 923 | + | |
0 commit comments