-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
spurious hydra test failures in travis #1169
Comments
This just failed in travis:
Nothing much in the logs here either. although the config.log did contain a write error
|
Add more debug output to the hydra tests to attempt to capture data useful for resolving flux-framework#1169.
Add more debug output to the hydra tests to attempt to capture data useful for resolving flux-framework#1169.
Add more debug output to the hydra tests to attempt to capture data useful for resolving flux-framework#1169.
I hit the PMI_RANK test failure on my desktop and noted the output file was empty. This makes me wonder if Two
If buffered I/O is used, redirecting to a file might result in an The default launcher is I should mention I am hitting this on Ubuntu 18.04.1 LTS with mpich 3.3~a2-4. |
Problem: occasionally mpiexec output from spawned tasks is lost, causing test to fail sporadically. Try adding the "-launcher fork" option. This overrides the default launcher, which is "ssh". Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing tests to fail sporadically. Instead of redirecting stdout, use the mpiexec -outfile option to let mpiexec redirect the output internally. Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing test to fail sporadically. Try adding the "-launcher fork" option. This overrides the default launcher, which is "ssh". Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing tests to fail sporadically. Instead of redirecting stdout, use the mpiexec -outfile option to let mpiexec redirect the output internally. Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing test to fail sporadically. Try adding the "-launcher fork" option. This overrides the default launcher, which is "ssh". Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing tests to fail sporadically. Instead of redirecting stdout, use the mpiexec -outfile option to let mpiexec redirect the output internally. Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing test to fail sporadically. Try adding the "-launcher fork" option. This overrides the default launcher, which is "ssh". Maybe this will fix flux-framework#1169
Problem: occasionally mpiexec output from spawned tasks is lost, causing tests to fail sporadically. Instead of redirecting stdout, use the mpiexec -outfile option to let mpiexec redirect the output internally. Maybe this will fix flux-framework#1169
I've seen that failure again even with the proposed mpiexec options, so I'll drop those suggested fixes from my PR. |
I am seeing this failure on Running on Ubuntu 19.10 with mpich 3.3-3. EDIT: Occasionally seeing an error on
|
We should just nix these tests that most commonly fail: diff --git a/t/t2004-hydra.t b/t/t2004-hydra.t
index 276bfa5e9..77eacb892 100755
--- a/t/t2004-hydra.t
+++ b/t/t2004-hydra.t
@@ -17,26 +17,6 @@ test_expect_success 'Hydra runs hello world' '
mpiexec.hydra -n 4 echo "Hello World"
'
-count_uniq_lines() { sort $1 | uniq | wc -l; }
-
-test_expect_success 'Hydra sets PMI_FD to unique value' '
- mpiexec.hydra -n 4 printenv PMI_FD > out &&
- test_debug "cat out" &&
- test $(count_uniq_lines out) -eq 4
-'
-
-test_expect_success 'Hydra sets PMI_RANK to unique value' '
- mpiexec.hydra -n 4 printenv PMI_RANK > out2 &&
- test_debug "cat out2" &&
- test $(count_uniq_lines out2) -eq 4
-'
-
-test_expect_success 'Hydra sets PMI_SIZE to uniform value' '
- mpiexec.hydra -n 4 printenv PMI_SIZE > out3 &&
- test_debug "cat out3" &&
- test $(count_uniq_lines out3) -eq 1
-'
-
test_expect_success 'Flux libpmi-client wire protocol works with Hydra' '
mpiexec.hydra -n 4 ${PMI_INFO}
'
They are not accomplishing much for us. |
Problem: hydra tests fail occasionally As noted in flux-framework#1169, some versions of hydra might have a problem capturing stdio, which makes these tests unreliable. Drop the tests that are just verifying hydra's PMI behavior, and make the ones that remain not dependent on stdio. Fixes flux-framework#1169
Problem: hydra tests fail occasionally As noted in flux-framework#1169, some versions of hydra might have a problem capturing stdio, which makes these tests unreliable. Drop the tests that are just verifying hydra's PMI behavior, and make the ones that remain not dependent on stdio. Fixes flux-framework#1169
Saw the failure above on travis test on master
Build log
I didn't see any clues in the log.
The text was updated successfully, but these errors were encountered: