[ROCm] Use tl.range()
in block GEMM kernels with num_stages
set by host.
#546
Loading
tl.range()
in block GEMM kernels with num_stages
set by host.
#546