-
Notifications
You must be signed in to change notification settings - Fork 802
Closed
Labels
cudaCUDA back-endCUDA back-endenhancementNew feature or requestNew feature or requestperformancePerformance related issuesPerformance related issues
Description
After 4fdbfae, there are preparations to switch atomic fetch_add/fetch_sub FP implementations to using the new SPIR-V operand. Providing a "native" implementation in the CUDA BE would enable us to use the leveraged function for NVPTX targets as well (#if !defined(__NVPTX__) macros would have to be removed to achieve this).
Metadata
Metadata
Assignees
Labels
cudaCUDA back-endCUDA back-endenhancementNew feature or requestNew feature or requestperformancePerformance related issuesPerformance related issues