Integrating fast_hadamard_transform on C++ level #17

BlackSamorez · 2024-12-10T00:17:51Z

This PR adds a kernel that wraps together a Fast Hadamard Transform (built from Tri Dao's implementation) and FLUTE GEMM to support HIGGS.

Notably, submodule initialization will be needed to build it.

BlackSamorez · 2024-12-10T19:29:37Z

@HanGuo97 Hi! Works on my machine. Could you check if it compiles and works for you as well?

flute/__init__.py

HanGuo97 · 2024-12-10T19:42:08Z

flute/csrc/hadamard.cpp

Is this a file a copy of that in the original "fast-hadamard-transform"? (Looks slightly different to me, and if not a copy, what's the difference?)

HanGuo97 · 2024-12-10T19:43:54Z

setup.py

@@ -89,6 +89,11 @@ def get_extensions() -> List:
    sources = (
        list(glob.glob(os.path.join(extensions_dir, "*.cpp"))) +
        list(glob.glob(os.path.join(extensions_dir, "*.cu"))))
+


minor, could we put them inside include_dirs and sources definition? (Looks slightly cleaner to me)

HanGuo97 · 2024-12-10T19:47:35Z

flute/csrc/qgemm.cpp

+>
+at::Tensor
+qgemm_hadamard(const at::Tensor& input,
+             const at::Tensor& weight,


minor indentation inconsistency :)

HanGuo97 · 2024-12-10T19:48:37Z

flute/csrc/qgemm.cpp

@@ -7,6 +7,9 @@
 #include "cute/numeric/integral_constant.hpp"


+at::Tensor fast_hadamard_transform(at::Tensor &x, float scale);


could we make the signature style similar to others (one line for output type, and one argument per line) --- this is the style used in CUTLASS

HanGuo97 · 2024-12-10T19:48:50Z

flute/csrc/qgemm.cpp

@@ -369,6 +372,45 @@ qgemm_raw_simple(const at::Tensor& input,
 }


+at::Tensor apply_hadamard(const at::Tensor& input, const cute::int64_t hadamard_size) {


could we make the signature style similar to others (one line for output type, and one argument per line) --- this is the style used in CUTLASS

BlackSamorez · 2024-12-12T18:09:22Z

Upd: swapped the Tri Dao's implementation for the HadaCore implementation because the former wouldn't work with torch.compile for some batch sizes.
It's ~20% slower though. TODO: choose the better option later.

BlackSamorez added 5 commits December 10, 2024 00:33

fast-hadamard-transform submodule

038442d

removed tag

d56bca5

hadamard maybe

eaf7c23

correct header

0a4f198

supposedly compiles

ecf58f0

BlackSamorez changed the title ~~[WIP] Integrating fast_hadamard_transform on C++ level~~ Integrating fast_hadamard_transform on C++ level Dec 10, 2024

fake ops args fix

da4426b

HanGuo97 reviewed Dec 10, 2024

View reviewed changes

BlackSamorez added 4 commits December 12, 2024 12:12

style

fac6f78

trash

7156540

Borrowed HadaCore kernel

e1c6178

updated setup

1f73c24

BlackSamorez added 3 commits December 12, 2024 19:16

rm gitmodules

94c6736

weights_only=True for no warning

b2d56e7

inplace=False

0836703

HanGuo97 merged commit 284771d into HanGuo97:main Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating fast_hadamard_transform on C++ level #17

Integrating fast_hadamard_transform on C++ level #17

BlackSamorez commented Dec 10, 2024 •

edited

Loading

BlackSamorez commented Dec 10, 2024

HanGuo97 Dec 10, 2024

HanGuo97 Dec 10, 2024

HanGuo97 Dec 10, 2024

HanGuo97 Dec 10, 2024

HanGuo97 Dec 10, 2024

BlackSamorez commented Dec 12, 2024

		@@ -7,6 +7,9 @@
		#include "cute/numeric/integral_constant.hpp"


		at::Tensor fast_hadamard_transform(at::Tensor &x, float scale);

		@@ -369,6 +372,45 @@ qgemm_raw_simple(const at::Tensor& input,
		}


		at::Tensor apply_hadamard(const at::Tensor& input, const cute::int64_t hadamard_size) {

Integrating fast_hadamard_transform on C++ level #17

Integrating fast_hadamard_transform on C++ level #17

Conversation

BlackSamorez commented Dec 10, 2024 • edited Loading

BlackSamorez commented Dec 10, 2024

HanGuo97 Dec 10, 2024

Choose a reason for hiding this comment

HanGuo97 Dec 10, 2024

Choose a reason for hiding this comment

HanGuo97 Dec 10, 2024

Choose a reason for hiding this comment

HanGuo97 Dec 10, 2024

Choose a reason for hiding this comment

HanGuo97 Dec 10, 2024

Choose a reason for hiding this comment

BlackSamorez commented Dec 12, 2024

BlackSamorez commented Dec 10, 2024 •

edited

Loading