-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
phi-related IR downgrade issue #488
Labels
kernels
Things about kernels and how they are compiled.
Comments
Which versions of Metal.jl? Please share |
In any case, this does look to hit a bug in the LLVM IR downgrader: @threadgroup_memory = external global [256 x i8]
define void @kernel() {
entry:
br label %exit
loop_entry:
%0 = phi i32* [ bitcast ([256 x i8]* @threadgroup_memory to i32*), %loop_cont1 ], [ null, %loop_cont2 ]
br label %exit
loop_cont1:
br i1 false, label %exit, label %loop_entry
loop_cont2:
br label %loop_entry
exit:
ret void
}
I'll have a look later this week. |
maleadt
changed the title
phi-related IR downgrade issue
Dec 2, 2024
NSError: Failed to materializeAll. (AGXMetalG14X, code 3)
Edit: This is using the latest |
Merged
#489 should fix this; can you verify? |
Yes, working now. Thanks a lot! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In my kernel, I get the following error.
Archive 2.zip
The code for the kernel (using KernelAbstractions.jl) is here:
https://github.com/trixi-framework/PointNeighbors.jl/pull/73/files#diff-d920170c38b8042080898ea9c427ca54e7f718b067a5a9bca914c89dedbbdeba
This is the kernel
foreach_neighbor_double_buffer
. The similar kernelforeach_neighbor_localmem
is compiling just fine (although running very slowly).Both run on Nvidia and AMD GPUs.
The text was updated successfully, but these errors were encountered: