This project contains an example of using rocBLAS for benchmarking matrix multiplication with various data types.
- ROCm
- rocBLAS
- rocPRIM
- rocThrust
- hipRAND
To build the project, follow these steps:
-
Create a build directory:
mkdir build cd build
-
Run CMake:
cmake ..
-
Compile the project:
make
Running
To run the benchmark, execute the following command in the build directory:
./rocblas_benchmark
This will run the benchmark with the specified problem sizes and configurations in inference_server_set. The output will show the time in milliseconds for each problem size and configuration for different data types (FP32, FP16-F32, FP16-F16, and INT8-INT32).
License
This project is open-source and free to use, modify, and distribute.