cutlass update #1

denghuilu · 2020-06-25T09:06:10Z

No description provided.

CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes

#82) #70 only updates the documentation. This commit reflects this bump in python version to the CMake configuration as well.

Adds support for NVIDIA Ampere Architecture features. CUDA 11 Toolkit recommended.

…100) - Updated mma_sm80.h to avoid perf penalty due to reinterpret_cast<>. - Enhancement to CUTLASS Utility Library's HostTensorPlanarComplex template to support copy-in and copy-out - Added test_examples target to build and test all CUTLASS examples - Minor edits to documentation to point to GTC 2020 webinar

* Updated documentation of fused GEMM example and removed UNITY BUILD batch size. The default batch size when unity build is enabled tends to be favorable.

kerrmudgeon and others added 5 commits April 7, 2020 13:51

CUTLASS 2.1 (#83)

96dab34

CUTLASS 2.1 contributes: - BLAS-style host-side API added to CUTLASS Library - Planar Complex GEMM kernels targeting Volta and Turing Tensor Cores - Minor enhancements and bug fixes

update tools/library/CMakeLists to require python 3.6 according to #70 (

e33d90b

#82) #70 only updates the documentation. This commit reflects this bump in python version to the CMake configuration as well.

CUTLASS 2.2 (#96)

86931fe

Adds support for NVIDIA Ampere Architecture features. CUDA 11 Toolkit recommended.

Added examples to enable the unity build (#102)

fd7e058

* Updated documentation of fused GEMM example and removed UNITY BUILD batch size. The default batch size when unity build is enabled tends to be favorable.

denghuilu merged commit 0a6b59b into denghuilu:master Jun 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cutlass update #1

cutlass update #1

denghuilu commented Jun 25, 2020

cutlass update #1

cutlass update #1

Conversation

denghuilu commented Jun 25, 2020