-
Notifications
You must be signed in to change notification settings - Fork 39
docs: add tutorial on raising #1935
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
02ecfaa to
7f2a3d2
Compare
|
needs EnzymeAD/Enzyme-JAX#1668 + corresponding jll |
docs/src/tutorials/raising.md
Outdated
|
|
||
| ## Raising GPU Kernels | ||
|
|
||
| <!-- TODO: write this section --> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have written this and I can complete with a tutorial 5fe6e01
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah for ease @Pangoraw mind just pushing that commit to this branch here?
4624977 to
5a47dd6
Compare
| - Running the raised compute kernel on hardware where the original kernel was not designed to run on (_i.e._ running a CUDA kernel on a TPU). | ||
| - Enabling further optimizations, since the raised kernel is now indiscernible from the rest of the program, it can be optimized with it. For example, two sequential kernel launches operating on the result of each others can be fused if they are both raised. Resulting in a single kernel launch, in the final optimized StableHLO program. | ||
| - Lastly, automatic-differentiation in Reactant is currently not supported for GPU kernels. Raising kernels enables Enzyme to differentiate the raised kernel. For this to function, one must use the `raise_first` compilation option to make sure the kernel are raised before Enzyme performs automatic-differentiation on the program. | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could we also add an example here as well [so folks dont get confused and think the scalar loop examples are part of gpu kernel raising]
739f1bf to
db94b19
Compare
No description provided.