-
Notifications
You must be signed in to change notification settings - Fork 221
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failed IntelMPI E2E tests #675
Comments
cc: @alculquicondor |
/kind flake If anyone has a solution, feel free to submit a PR. |
@tenzen-y: The label(s) In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
In my investigation, it seems that even the v0.4.0 intel-pi image has the same problem. |
Are you saying that this is failing with released images too? |
Yes.
I'm not sure that the root cause is latest Intel MPI since our E2E testing does not use the latest pi image: mpi-operator/test/e2e/e2e_suite_test.go Line 52 in c738a83
Note that The building image name during E2E testing is Line 75 in c738a83
After I found this problem, I tried to fix the E2E image so that we could use the latest build image like bce83b6. But I keep facing the same errors... |
Note that this failure happens only in GitHub actions. In my local and other developer environments, E2E succeeded. |
Currently, I'm doubting the Pod-to-Pod communication compatibility between Kind and GH actions |
After I switched to MInikube in CI, I faced the same errors again: 005e72f So, the root cause seems to be different. |
Intel MPI E2E tests failed in CI:
mpi-operator/test/e2e/mpi_job_test.go
Lines 207 to 272 in c738a83
We have seen this problem in the following:
The text was updated successfully, but these errors were encountered: