[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

srkreddy1238 · 2023-01-25T13:10:08Z

Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file.

New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning.

CLML layer profiling now uses OpenCL Timer interface.

This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details.

Co-Authored-By: Krishna Raju Vegiraju quic_kvegiraju@quicinc.com

tvm-bot · 2023-01-25T13:10:11Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @areusch _{See #10317 for details}

_{Generated by tvm-bot}

Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file. New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning. CLML layer profiling now uses OpenCL Timer interface. This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details.

echuraev

Several comments

src/runtime/opencl/opencl_common.h

src/runtime/contrib/clml/clml_runtime.cc

Co-authored-by: Egor Churaev <egor.churaev@gmail.com>

echuraev

LGTM. Thanks

* [RUNTIME][CLML] OpenCLML tuning and profiling enhanced Tuning cache bin is serialized through DMLC::Stream to support multiple CLML sub graphs with in a tvm module. Individual tuning cache blobs are saved to same output file. New API on OpenCLWorkspace to enable or disable profiling on command queue rather doing this only when Timer is invoked. This is required to perform CLML operator tuning. CLML layer profiling now uses OpenCL Timer interface. This PR also fix avoiding pad operator offloading at the very first layer (to be specific before at least one convolution layer) due to the limitation of CLML pad operator is concerned about layout. Please refer to CLML SDK documentation for more details. * Update src/runtime/opencl/opencl_common.h Co-authored-by: Egor Churaev <egor.churaev@gmail.com> * * review comments --------- Co-authored-by: Egor Churaev <egor.churaev@gmail.com>

srkreddy1238 force-pushed the clml_tuning branch 3 times, most recently from 199755d to 4f672d5 Compare January 26, 2023 02:09

srkreddy1238 force-pushed the clml_tuning branch from 4f672d5 to 9960020 Compare January 26, 2023 03:35

echuraev reviewed Jan 26, 2023

View reviewed changes

src/runtime/opencl/opencl_common.h Outdated Show resolved Hide resolved

src/runtime/opencl/opencl_common.h Outdated Show resolved Hide resolved

src/runtime/contrib/clml/clml_runtime.cc Outdated Show resolved Hide resolved

srkreddy1238 and others added 2 commits January 27, 2023 12:24

Update src/runtime/opencl/opencl_common.h

7e77a7e

Co-authored-by: Egor Churaev <egor.churaev@gmail.com>

* review comments

535d1e0

srkreddy1238 force-pushed the clml_tuning branch from a8b79f5 to 535d1e0 Compare January 28, 2023 05:11

echuraev approved these changes Jan 30, 2023

View reviewed changes

echuraev merged commit 3c81d9b into apache:main Jan 30, 2023

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

srkreddy1238 commented Jan 25, 2023 •

edited

Loading

tvm-bot commented Jan 25, 2023

echuraev left a comment

echuraev left a comment

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

[RUNTIME][CLML] OpenCLML tuning and profiling enhanced #13843

Conversation

srkreddy1238 commented Jan 25, 2023 • edited Loading

tvm-bot commented Jan 25, 2023

echuraev left a comment

Choose a reason for hiding this comment

echuraev left a comment

Choose a reason for hiding this comment

srkreddy1238 commented Jan 25, 2023 •

edited

Loading