-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Operator] Further performance enhancements for conv2D #290
Conversation
Aalanli
commented
Jun 23, 2023
- changed matmul_fp16 to support last dimension a multiple of 2 or 4
- added custom pre-transform kernel to replace auto scheduler generated version
- moved padding to within the conv2d operator to remove redundant padding
not tested
test for numerical differences
More changes: |
Hi @Aalanli, let me know (by ping me in the PR) if this PR is ready to be reviewed. Sometimes I am not sure whether a PR is ready to be reviewed even when the CI is passed. You can also add [WIP] prefix (work in progress) to the PR title explicitly if you are still working on a PR. |
Hi @yaoyaoding, thanks for the message. This PR is ready to be reviewed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @Aalanli, I left some comments.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Thanks @Aalanli !
- setitem with tensor values - boolean type promotion --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>
- setitem with tensor values - boolean type promotion --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>
- setitem with tensor values - boolean type promotion --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>