Skip to content

LeiWang1999/rocblas-benchmark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rocBLAS Benchmark

This project contains an example of using rocBLAS for benchmarking matrix multiplication with various data types.

Requirements

  • ROCm
  • rocBLAS
  • rocPRIM
  • rocThrust
  • hipRAND

Building

To build the project, follow these steps:

  1. Create a build directory:

    mkdir build  
    cd build  
  2. Run CMake:

    cmake ..   
  3. Compile the project:

    make  

Running

To run the benchmark, execute the following command in the build directory: ./rocblas_benchmark

This will run the benchmark with the specified problem sizes and configurations in inference_server_set. The output will show the time in milliseconds for each problem size and configuration for different data types (FP32, FP16-F32, FP16-F16, and INT8-INT32).

License

This project is open-source and free to use, modify, and distribute.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published