Skip to content

CUDA: use mma PTX instructions for FlashAttention #19018

CUDA: use mma PTX instructions for FlashAttention

CUDA: use mma PTX instructions for FlashAttention #19018

windows-latest-cmake (llvm-arm64-opencl-adreno, -G "Ninja Multi-Config" -D CMAKE_TOOLCHAIN_FILE=c...

succeeded Feb 2, 2025 in 4m 20s