-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Fix the incorrect MKLDNN/MKL logic in cmake #14877
Conversation
update MKL_ROOT
@mxnet-label-bot add [pr-awaiting-review] |
ci/jenkins/Jenkins_steps.groovy
Outdated
ws('workspace/build-cpu-nomkldnn-mkl') { | ||
timeout(time: max_time, unit: 'MINUTES') { | ||
utils.init_git_win() | ||
powershell 'py -3 ci/build_windows.py -f WIN_CPU_NOMKLDNN_MKL' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The build flavor(-f) should be WIN_CPU_MKL
instead of WIN_CPU_NOMKLDNN_MKL
since it has been modified from the latter in ci/build_windows.py
(line 93).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thank you for fix. change it now.
ci/build_windows.py
Outdated
@@ -218,6 +246,8 @@ def main(): | |||
os.environ["OpenCV_DIR"] = "C:\\Program Files\\OpenCV-v3.4.1\\build" | |||
if 'CUDA_PATH' not in os.environ: | |||
os.environ["CUDA_PATH"] = "C:\\Program Files\\NVIDIA GPU Computing Toolkit\\CUDA\\v9.2" | |||
if 'MKL_ROOT' not in os.environ: | |||
os.environ["MKL_ROOT"] = "C:\Program Files (x86)\IntelSWTools\compilers_and_libraries\windows\mkl" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure about the effects of the escape characters here. But for the path string, I think using a raw string prefix flag (r'the/path/string'), in this case, is better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we test both format should be ok. anyway change it now. thanks
fix nomkldnn issue.
change MKL_ROOT
@Chancebair @marcoabreu Could you help to install a full MKL library for window CI? we supposed the full MKL will be installed to C:\Program Files (x86)\IntelSWTools\compilers_and_libraries\windows\mkl on windows. You can get the executable here: http://registrationcenter-download.intel.com/akdlm/irc_nas/tec/15247/w_mkl_2019.3.203.exe thanks |
@yinghu5 I created a ticket with my teams oncall to look into this |
Hi @Chancebair, do you have any update for the ticket? I don't want to rush you but I hope this PR can catch the code freeze of 1.5.0 release as I see there are quite a few issues about Windows build. It will also help improving the coverage of existing CI system. Thank you. |
MKL will be installed on windows sometime today thanks to @lebeg |
Great! Thank you @Chancebair and @lebeg . @yinghu5 please try if the windows CI works now~ |
@Chancebair and @lebeg , thank you a lot! is the MKL installed to C:\Program Files (x86)\IntelSWTools\compilers_and_libraries\windows\mkl ? the current CI test shows, |
windows md5
windows md5
export MKLROOT
powershell 'cd $Env:MKL_ROOT'
I've updated the scripts for AMI creation, but currently we have a blocker on @Chancebair side. Once he deploys the new AMI's MKL should be available at the exact path you've mentioned. |
I've updated the windows instances to the new AMI. It may take some time to cycle them with the autoscaler, but please give it a go. |
trigger MKL_ROOT test
test MKL_ROOT line 552
update 538 $Env:MKL_ROOT
…to yinghu5-cmake-1
@perdasilva @marcoabreu May I have your comments on the CI status? Seems it doesn't work now. |
@yinghu5 I've checked the windows instances running on prod, and they are running with the new AMIs. |
I also see that there's been a successful run since the update. Could it be that the TensorRT issue was throwing you off? The current error is on the powershell side. |
This line probably doesn't like backslashes. |
ci/jenkins/Jenkins_steps.groovy
Outdated
ws('workspace/build-cpu-mkl') { | ||
timeout(time: max_time, unit: 'MINUTES') { | ||
utils.init_git_win() | ||
powershell '$Env:MKL_ROOT="C:\Program Files (x86)\IntelSWTools\compilers_and_libraries\windows\mkl"' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you move this code into build_windows.py? We prefer to have these scripts self-contained and the Jenkinsfile just being an orchestration tool.
@lebeg @marcoabreu @TaoLv @pengzhao-intel thank you very much for the support! after remove the redundancy test MKL_ROOT code as you suggested, the CI was triggered today. and all windows CPU parts, windows-cpu, which the PR want to fix passed successfully. Thank you! |
It's great @yinghu5, please rebase the code again. The GPU failure is fixed now. |
All CI passed, except the unix-gpu tensorRT test error. thanks |
…to yinghu5-cmake-1
@lebeg @marcoabreu @perdasilva the Windows CI is built, thanks all of your help. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM - maybe @lebeg could you have a look at the cmake changes?
It's a nice first PR for MXNet, thanks @yinghu5. |
* fix cmake for win and mkl * Update build_windows.py update MKL_ROOT * Update Jenkins_steps.groovy fix nomkldnn issue. * Update build_windows.py change MKL_ROOT * msshadow_blas * update mshasow * Update DownloadMKLML.cmake windows md5 * Update DownloadMKLML.cmake windows md5 * Update Jenkins_steps.groovy export MKLROOT * Update Jenkins_steps.groovy powershell 'cd $Env:MKL_ROOT' * Update Jenkins_steps.groovy trigger MKL_ROOT test * Update Jenkins_steps.groovy test MKL_ROOT line 552 * Update Jenkins_steps.groovy update 538 $Env:MKL_ROOT * update mshadow * revert gitmodules * remove comments * remove test code * remove test powershell
This is a re-creation of #14829 . I've accidentally deleted my fork of incubator-mxnet.
Description
We actually want to unlock the MKLDNN (MKLML) and MKL dependency, which means use MKLDNN and USE_BLAS=mkl completely independent.