-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Remove dead code from depthwise_conv2d for Intel graphics #8381
[Refactor] Remove dead code from depthwise_conv2d for Intel graphics #8381
Conversation
cbd8d5d
to
3a632b7
Compare
After fix a66186b, I saw that it should be necessary to do the same fix for depthwise_conv2d for intel graphics. I saw that we never used the removed code and it is just the same code from cuda/depthwise_conv2d.py. So we can use the cuda implementation when it will be necessary.
3a632b7
to
d84a9d8
Compare
@echuraev BTW is |
@elvin-n did some experiments with intel graphics. Maybe he can give more accurate information about intel grahics. |
@masahi
Several more questions to you
|
Interesting! Since our
The result was compared against GTX 1070 ti, with peak 8 TFLOPS. End to end result:
DETR
I consider this results excellent for Gen11, given HW peak difference and the fact that auto scheduler was hyper optimized for CUDA + dGPU.
|
The problem is that scheduling for intel_graphics convolutions use(d) "warp" scope while it is not defined neither for OpenCL nor for Metal. And I do not know how it could work previously. I have not seen that |
…pache#8381) After fix a66186b, I saw that it should be necessary to do the same fix for depthwise_conv2d for intel graphics. I saw that we never used the removed code and it is just the same code from cuda/depthwise_conv2d.py. So we can use the cuda implementation when it will be necessary.
…pache#8381) After fix a66186b, I saw that it should be necessary to do the same fix for depthwise_conv2d for intel graphics. I saw that we never used the removed code and it is just the same code from cuda/depthwise_conv2d.py. So we can use the cuda implementation when it will be necessary.
After fix a66186b, I saw that it should be necessary to do the same fix
for depthwise_conv2d for intel graphics. I saw that we never used the
removed code and it is just the same code from
cuda/depthwise_conv2d.py. So we can use the cuda implementation when it
will be necessary.
Thanks for contributing to TVM! Please refer to guideline https://tvm.apache.org/docs/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.