-
Notifications
You must be signed in to change notification settings - Fork 185
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update to ROCm 5.4.2 and TBB 2021.8.0 #8273
Update to ROCm 5.4.2 and TBB 2021.8.0 #8273
Conversation
please test |
A new Pull Request was created by @fwyzard (Andrea Bocci) for branch IB/CMSSW_13_0_X/master. @smuzaffar, @aandvalenzuela, @iarspider can you please review it and eventually sign? Thanks. |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-a60203/30243/summary.html External BuildI found compilation error when building: + '[' -d /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/pkgconfig ']' + rm -f '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/*.la' /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-amdgcn.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-nvptx.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhsakmt.a rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-amdgcn.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-nvptx.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhsakmt.a': Read-only file system error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.7NvLWG (%install) RPM build errors: line 35: It's not recommended to have unversioned Obsoletes: Obsoletes: external+rocm+5.4.2-689d2a6a8858d80b8fa33b31f7723664 Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.7NvLWG (%install) |
Add support for - AMD Instinct MI50/MI60 (gfx906) - AMD Instinct MI100/MI210/MI250 (gfx908)
Pull request #8273 was updated. |
please test |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-a60203/30246/summary.html External BuildI found compilation error when building: + '[' -d /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/pkgconfig ']' + rm -f '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/*.la' /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-amdgcn.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-nvptx.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhsakmt.a rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-amdgcn.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhipfort-nvptx.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/689d2a6a8858d80b8fa33b31f7723664/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-689d2a6a8858d80b8fa33b31f7723664/lib/libhsakmt.a': Read-only file system error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.TZqyUN (%install) RPM build errors: line 35: It's not recommended to have unversioned Obsoletes: Obsoletes: external+rocm+5.4.2-689d2a6a8858d80b8fa33b31f7723664 Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.TZqyUN (%install) |
please test |
Pull request #8273 was updated. |
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-a60203/30256/summary.html External BuildI found compilation error when building: + '[' -d /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/pkgconfig ']' + rm -f '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/*.la' /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhipfort-amdgcn.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhipfort-nvptx.a /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhsakmt.a rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhipfort-amdgcn.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhipfort-nvptx.a': Read-only file system rm: cannot remove '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/7481b1c5a78445d17d8889da9095c769/opt/cmssw/el8_amd64_gcc11/external/rocm/5.4.2-7481b1c5a78445d17d8889da9095c769/lib/libhsakmt.a': Read-only file system error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.u4v7Yw (%install) RPM build errors: line 35: It's not recommended to have unversioned Obsoletes: Obsoletes: external+rocm+5.4.2-7481b1c5a78445d17d8889da9095c769 Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.u4v7Yw (%install) |
d343cd8
to
d4ec6e6
Compare
@fwyzard , |
would it be OK to create the temporary files and leave them there, until they are deleted by hand ? |
@smuzaffar , if we add |
I noticed that setting TMPDIR to Create temp files under |
ah ok, thanks. I am testing it now |
However, I do not know if that may create problems during concurrent builds ( |
I think this might be a better solution - maybe even use |
By the way, can we make any further changes in a separate PR ? |
hopefully current changes should not break any thing in cmssw as nothing depends on rocm in cmssw . Although we can make change here but that means restarting the PR tests so I would suggest to lets get this in once current tests are done (pr tests are already building cmssw now). The change for |
@fwyzard , do we have rocm distributions for SLC7, RH9? |
We have them for RHEL7 under We have them for RHEL9 under |
@smuzaffar if on a CMSSW PR I ask the bot to |
looks pretty much unrelated ? |
@smuzaffar can we merge this, or should I try to rerun the tests ? |
-1 Failed Tests: UnitTests Unit TestsI found errors in the following unit tests: ---> test testTriggerMonitors had ERRORS Comparison SummarySummary:
|
+externals |
This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_13_0_X/master IBs (but tests are reportedly failing). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @rappoccio (and backports should be raised in the release meeting by the corresponding L2) |
Update ROCm to version 5.4.2.
Enable support for newer AMD GPUs:
Update TBB to version 2021.8.0.