Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tiled matrix multiplication example demonstrating shared memory api and usage #25

Merged
merged 3 commits into from
Mar 27, 2024

Conversation

arhik
Copy link
Member

@arhik arhik commented Mar 27, 2024

  • Shared memory can be declared in kernel (type and length is essential; which is different from CUDA.jl)
  • Private memory interface can also be along these lines.

arhik added 3 commits March 26, 2024 21:36
Using shared memory is possible now. The worked out example is tiled matrix multiplication. Though shared memory  usage is clear. This is not about correctness. Just about how to interface. multiplication still needs work.
Update tiled_matmul_kernel.jl

This example demonstrate shared memory api and usage
Copy link
Member Author

@arhik arhik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@arhik arhik merged commit 176e3dd into JuliaWGPU:main Mar 27, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant