gpu_cpu_math_comparison

Comparison of few GPU and CPU mathematical routines from BLAS and LAPACK libraries.

This simple project is focused at testing BLAS and LAPACK functions in GPU (cublas,cula) and CPU (blas, lapack) realms.

Example:

Comparing DGEMM, CPU_CUDA vs CPU_BLAS, matrix size 2000x2000, Tesla K20m

 gpu-cpu-comparison.x  dgemm_cuda 2000
 .
 .
 .
  cuda_dgemm GPU vs CPU test
  Routine <GPU CUDA DGEMM> spent runtime:   0.05900000 seconds.
 aver. sum of diag:   2001.0000000000000      1,1:   2.0000000000000000      1,n:   2001.0000000000000      n,1:   2001.0000000000000      n,n   4000.0000000000000 
  Routine <CPU BLAS DGEMM> spent runtime:   7.13600000 seconds.
 aver. sum of diag:   2001.0000000000000      1,1:   2.0000000000000000      1,n:   2001.0000000000000      n,1:   2001.0000000000000      n,n   4000.0000000000000

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
cmake		cmake
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
setup		setup

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gpu_cpu_math_comparison

Example:

About

Releases

Packages

Languages

License

miroi/gpu_cpu_math_comparison

Folders and files

Latest commit

History

Repository files navigation

gpu_cpu_math_comparison

Example:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages