-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New RAJA GPU Indexing #342
Conversation
in NESTED_INIT, POLYBENCH_2MM, and DIFFUSION3DPA
Updated this so we can get new performance results. It should be ready to go. |
@rhornung67 @artv3 Can I get a review? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did you do all the kernels that use RAJA::kernel and RAJA::launch? I was going to work through that, but thank you for doing it @MrBurmark
I think I did all the kernels. |
Use new RAJA GPU indexing policies
This gives compile time knowledge of the block sizes in some kernels. This brings the RAJA variants closer in line with the base variants.
The base variants can still take a advantage of the knowledge that the ranges start at 0.