-
Notifications
You must be signed in to change notification settings - Fork 99
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #1429 from kokkos/half-precision
Minor changes for half precision paper
- Loading branch information
Showing
3 changed files
with
31 additions
and
3 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
26 changes: 26 additions & 0 deletions
26
perf_test/blas/blas3/papers/kokkos-half-t-rse-escience-2022/reproducer.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
## To reproduce the half precision results for batched-GEMM: | ||
```bash | ||
git clone https://github.com/kokkos/kokkos.git | ||
git clone https://github.com/kokkos/kokkos-kernels.git | ||
cd kokkos-kernels | ||
git checkout tags/papers/us-rse-escience-2022 | ||
cd perf_test/blas/blas3 | ||
export KOKKOS_SRC_DIR=/path/to/kokkos | ||
export KOKKOSKERNELS_SRC_DIR=/path/to/kokkos-kernels | ||
``` | ||
|
||
### On V100 | ||
```bash | ||
./KokkosBatched_BatchedGemm_benchmark.sh double SNB VOLTA70 | ||
./KokkosBatched_BatchedGemm_benchmark.sh float SNB VOLTA70 | ||
./KokkosBatched_BatchedGemm_benchmark.sh half SNB VOLTA70 | ||
./KokkosBatched_BatchedGemm_benchmark.sh bhalf SNB VOLTA70 | ||
``` | ||
|
||
### On A100 | ||
```bash | ||
./KokkosBatched_BatchedGemm_benchmark.sh double DEFAULT AMPERE80 | ||
./KokkosBatched_BatchedGemm_benchmark.sh float DEFAULT AMPERE80 | ||
./KokkosBatched_BatchedGemm_benchmark.sh half DEFAULT AMPERE80 | ||
./KokkosBatched_BatchedGemm_benchmark.sh bhalf DEFAULT AMPERE80 | ||
``` |