Incremental Cache Benchmark Cuda

Computes an array of fixed length with CPU and GPU to execute sequential accesses and stress the cache. This benchmark should be used in conjunction with "nvprof" and an incremental load, obtained by increasing the size of the array section computed from 1/4000 (for Xavier, 1/16000 on TX2), to 1/1.

Usage

Simply compile by running the compile script (currently configured for Volta/Xavier) on a Jetson Board:

./compile

and then run with

./inc_bench <SPLIT_SIZE>

and

nvprof --metrics l2_read_throughput,l2_write_throughput ./inc_bench <SPLIT_SIZE>

Where split size is the Array section length (higher value means lower computed length).

Configuration

To modify the configuration change the variables contained at the start of "inc_bench.cu", comments show what they do:

// Set PRINT to 1 for debug output
#define PRINT 0
#define FROM_debug 0
#define TO_debug 16

// Set ZEROCOPY to 1 to use Zero Copy Memory Mode, UNIFIED to 1 to use Unified Memory, COPY to 1 to use Copy
#define ZEROCOPY 0
#define UNIFIED 0
#define COPY 1

// Set RESULTCHECK to 1 to verify the result with a single CPU thread DO NOT ENABLE, results are non-deterministic
#define RESULTCHECK 0

// Set CPU to 1 to use the CPU concurrently
#define CPU 1
// Set OPENMP to 1 to use more than 1 thread for the CPU
#define OPENMP 1

unsigned int N = 2;
const int POW = 14;			 // Maximum is 30, anything higher and the system will use swap, making the Cuda kernels crash
const int RUNS = 10; 		 // How many times the benchmark is run
const int SUMS = 2;			// As CPU and GPU work on either the left side or right side, this number indicates how many "side swaps" there will be
const int BLOCK_SIZE_X = 32; // Cuda Block Size

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
include		include
README.md		README.md
compile		compile
inc_bench.cu		inc_bench.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incremental Cache Benchmark Cuda

Usage

Configuration

About

Releases

Packages

Languages

FrancescoL96/Incremental-Cache-Benchmark-Cuda

Folders and files

Latest commit

History

Repository files navigation

Incremental Cache Benchmark Cuda

Usage

Configuration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages