-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[OpenCLML] More ops and network coverage #12762
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @srkreddy1238, thanks for the PR. Can you improve the commit message with the actual operators are being added?
@srkreddy1238 You can look at https://github.com/apache/tvm/blob/main/docs/contribute/pull_request.rst#commit-message-guideline for some guidelines on how to do that. HTH. Cheers. |
Added operators pooling (avg, max), binary operators (add, subtract, multiply, min, max) and concat. Clip operator with min=0 and max=6 is remapped to relu6 to take advantage of CLML acceleration without sub graphing this to fallback path. Added new test cases for above listed operators and also end-to-end network test cases for Resnet50 & InceptionV3. CLML support FP16 arithmetic mode which gives significant performance boost over FP32. This PR enhances FP16 usage based on Operator datatype in relay graph. Co-authored-by: Krishna Raju quic_kvegiraj@quicinc.com Co-authored-by: Shwetank Singh quic_shwesing@quicinc.com
@leandron pls take a look. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @srkreddy1238 @gromero!
Added operators pooling (avg, max), binary operators (add, subtract, multiply, min, max) and concat. Clip operator with min=0 and max=6 is remapped to relu6 to take advantage of CLML acceleration without sub graphing this to fallback path. Added new test cases for above listed operators and also end-to-end network test cases for Resnet50 & InceptionV3. CLML support FP16 arithmetic mode which gives significant performance boost over FP32. This PR enhances FP16 usage based on Operator datatype in relay graph. Co-authored-by: Krishna Raju quic_kvegiraj@quicinc.com Co-authored-by: Shwetank Singh quic_shwesing@quicinc.com
Added operators pooling (avg, max), binary operators (add, subtract, multiply, min, max) and concat.
Clip operator with min=0 and max=6 is remapped to relu6 to take advantage of CLML acceleration
without sub graphing this to fallback path.
Added new test cases for above listed operators and also end-to-end network test cases for Resnet50
& InceptionV3.
CLML support FP16 arithmetic mode which gives significant performance boost over FP32. This PR
enhances FP16 usage based on Operator datatype in relay graph.
Co-authored-by: Krishna Raju quic_kvegiraj@quicinc.com
Co-authored-by: Shwetank Singh quic_shwesing@quicinc.com