-
Notifications
You must be signed in to change notification settings - Fork 577
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add RUN_SERIAL test property to many ATDM Trilinos tests on CUDA builds to address timeouts #7112
Comments
NOTE: Using the CTest resource allocation approach does not help on 'ride' for some reason so we are still getting quite a few timeouts there. Therefore, I will go ahead and implement this option and apply |
TriBITSPub#173, trilinos/Trilinos#7112) Turns out that TRIBITS_ADD_TEST() had bad logic for reading in <fullTestName>_SET_RUN_SERIAL and <fullTestName>_SET_DISABLED_AND_MSG when the test name is modifed by setting TPL_ENABLE_MPI=ON or passing in NAME_POSTFIX or POSTFIX_AND_ARGS_<IDX> arguments. These are obvious use cases that I missed and they are needed for trilinos/Trilinos#7112 and in general, obviously. This means that <fullTestName>_SET_DISABLED_AND_MSG was not implemented correctly for TRIBITS_ADD_TEST() as part of TriBITSPub#173. Turns out the driving use case for that was using TRIBITS_ADD_ADVACED_TEST() so we never noticed the bug. The next commit will fix this (which I already prototyped).
…trilinos#7112) This makes the testing workflow with TriBITS so much easiler.
…nos/Trilinos#7112) Build/Test Cases Summary Enabled Packages: Enabled all Packages 0) MPI_DEBUG => passed: passed=381,notpassed=0 (2.15 min) 1) SERIAL_RELEASE => passed: passed=381,notpassed=0 (1.91 min) 2) MPI_DEBUG_CMake-3.17.0 => passed: passed=386,notpassed=0 (2.23 min) 3) SERIAL_RELEASE_CMake-3.17.0 => passed: passed=386,notpassed=0 (2.18 min) Other local commits for this build/test group: 1b00a3c, 32e3fbc, 92c8ac0, a648f77, c42eb66, ef76b3c, afb7f2e
Origin repo remote tracking branch: 'github/master' Origin repo remote repo URL: 'github = git@github.com:TriBITSPub/TriBITS.git' At commit: commit cee1980d053ec2c26301d2389c8b0a677fa262fe Author: Roscoe A. Bartlett <rabartl@sandia.gov> Date: Sat Sep 19 07:40:49 2020 -0600 Summary: Merge remote-tracking branch 'rab-github/tril-7112-run-serial' (#7112) This represents the changes in the TriBITS PRs: * TriBITSPub/TriBITS#327 : Fix install for MacOSX (#7881) * TriBITSPub/TriBITS#328 : Add support for <fullTestName>_SET_RUN_SERIAL=[ON|OFF] (#7112)
…nos/Trilinos#7112) Here, I make some of the asserts for the RUN_SERIAL tests hopefully a little more clear (trilinos/Trilinos#7112). As part of this I also reorgainized the unit tests some to group them better. I broke off the RUN_SERIAL tests for TAAT() into thier own function. However, more needs to be done to improve the organization of these tests and make them more independent. (What we really need is a proper unit test harness for CMake code but that will take some work to create.)
…_RK_ARS_233_MPI_1 RUN_SERIAL (trilinos#7112, trilinos#8063)
…inos#7112) This is an attempt to address review comemnts in PR trilinos/Trilinos#8063.
…inos#7112) This is an attempt to address review comemnts in PR trilinos/Trilinos#8063.
Build/Test Cases Summary Enabled Packages: Enabled all Packages 0) MPI_DEBUG => passed: passed=382,notpassed=0 (2.05 min) 1) SERIAL_RELEASE => passed: passed=382,notpassed=0 (1.99 min) 2) MPI_DEBUG_CMake-3.17.0 => passed: passed=387,notpassed=0 (2.63 min) 3) SERIAL_RELEASE_CMake-3.17.0 => passed: passed=387,notpassed=0 (2.26 min) Other local commits for this build/test group: 2850245
@brian-kelley, do you mind if we remove the hard-coded Note, we should leave |
@bartlettroscoe Yes, that's fine. I think adding RUN_SERIAL to non-performance tests was a temporary hack for RIDE. |
…s:develop' (4557faa). * trilinos-develop: Piro: fixing issue where Trapezoidal Rule Solver was not using the (trilinos#8090) Tpetra: fix warnings Tests: Fixing geminga test Phalanx: fix for gcc 5/6 lambda bug on cuda Automatic snapshot commit from tribits at 1ed3811 ATDM: ride: Tempus_IMEX_RK_Partitioned_Staggered_FSA_Partitioned_IMEX_RK_ARS_233_MPI_1 RUN_SERIAL (trilinos#7112, trilinos#8063) Fixed name Make case for output names consistent Automatic snapshot commit from tribits at 362e27d MueLu: free fine comp operator as soon as possible Automatic snapshot commit from tribits at cee1980 Allow pointing to a different TriBITS in ctest -S driver for Trilinos (trilinos#7112) ATDM: ride: Add RUN_SERIAL for several timing out tests (trilinos#7112) Zoltan2: Refactor directory to use Teuchos comm
…s:develop' (4557faa). * trilinos-develop: Piro: fixing issue where Trapezoidal Rule Solver was not using the (trilinos#8090) Tpetra: fix warnings Tests: Fixing geminga test Phalanx: fix for gcc 5/6 lambda bug on cuda Automatic snapshot commit from tribits at 1ed3811 ATDM: ride: Tempus_IMEX_RK_Partitioned_Staggered_FSA_Partitioned_IMEX_RK_ARS_233_MPI_1 RUN_SERIAL (trilinos#7112, trilinos#8063) Fixed name Make case for output names consistent Automatic snapshot commit from tribits at 362e27d MueLu: free fine comp operator as soon as possible Automatic snapshot commit from tribits at cee1980 Allow pointing to a different TriBITS in ctest -S driver for Trilinos (trilinos#7112) ATDM: ride: Add RUN_SERIAL for several timing out tests (trilinos#7112) Zoltan2: Refactor directory to use Teuchos comm
This issue has had no activity for 365 days and is marked for closure. It will be closed after an additional 30 days of inactivity. |
This issue was closed due to inactivity for 395 days. |
I need to add option
<fullTestName>_SET_RUN_SERIAL=[ON|OFF]
toTRIBITS_ADD[ADVANCED_]TEST()
to add the CTestRUN_SERIAL
test property and use to mark many CUDA tests in Trilinos GitHub issues #7090, #6805, #6804, #6801But first, we need to finish up and merge #6840 to first get the work spread out over the GPUs to see if that resolves some of these timeouts.
The text was updated successfully, but these errors were encountered: