Test Case

Engine	Type	Command line
DML	MHA	`cross_runner.exe --type mha_dml mha_opts --mha_type qkv --data_type fp16 --layout ncdhw --shape_input_qkv 2,4096,8,3,40`
[DML	MHA'cross_runner.exe --type mha_dml mha_opts --mha_type q_kv --data_type fp16 --layout ncdhw --shape_input_q 2,4096,320 --shape_input_kv 2,4096,8,2,40'
DML	QGEMM	`cross_runner.exe --type quant_gemm_dml quant_gemm_opts --layout nchw --data_type fp16 --quantize_data_type uint4 --shape_a 1,1,1,14336 --shape_b 1,1,4096,14336 --shape_c 1,1,1,4096 --b_transposed --b_quantized --block_size 32 --has_zeropoint`

*stateless kernel support

add --use_stateless in your command line, like:

# GEMM test case
./cross_runner.exe --iters 1  --type gemm_cm gemm_opts --gemm_type ab  --data_type fp16 --layout nchw --shape_a 1,1,1024,16 --shape_b 1,1,16,1024 --b_managed gemm_cm_opts --large_grf --tile_m 1 --tile_k 16 --tile_n 16 --slice_k 1 --dump_asm --use_stateless

copy stateless version kernel to crossrunner binary folder, now we have two samples:

tools\cross_runner\kernels\gemm_nchw_fp16_stateless.cpp

Building

git clone
git submodule update --init --recursive
cmake

In case of CMake errors, especially those related to finding the OpenCL installation, you can try the following steps:

Install the full GPU driver package from your GPU vendor's official website. This should include the OpenCL runtimes necessary for development.
Download the OpenCL headers and loader from the Khronos Group's official GitHub repository or use the OpenCL SDK provided by your GPU vendor.
If you encounter issues with CMake finding OpenCL, you can specify the paths to the OpenCL headers and library using the following command line (adjust the paths to match your OpenCL SDK installation):

cmake . -Bbuild -G "Visual Studio 17 2022" -DOpenCL_INCLUDE_DIR="C:\path\to\OpenCL\include" -DOpenCL_LIBRARY="C:\path\to\OpenCL\lib\opencl.lib"

cd build
# the default build was Debug version
cmake  --build .  -j

#if your Debug version notwork, pls try Build Release version
cmake  --build . --config Release -j

Name		Name	Last commit message	Last commit date
Latest commit History 171 Commits
thirdparty		thirdparty
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
ReadMe.md		ReadMe.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Test Case

*stateless kernel support

Building

About

Releases

Packages

Contributors 9

Languages

smarcink/dml_runner

Folders and files

Latest commit

History

Repository files navigation

Test Case

*stateless kernel support

Building

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages