Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail fast on setup failure #1956

Closed
wants to merge 1 commit into from

Conversation

ironcladlou
Copy link
Contributor

If cluster setup fails, touch the shared exit signal file immediately. Otherwise
the pod won't fail for hours.

If cluster setup fails, touch the shared exit signal file immediately. Otherwise
the pod won't fail for hours.
@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ironcladlou
To fully approve this pull request, please assign additional approvers.
We suggest the following additional approver: yifan-gu

If they are not already assigned, you can assign the PR to them by writing /assign @yifan-gu in a comment when ready.

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Oct 17, 2018
@ironcladlou
Copy link
Contributor Author

/cc @smarterclayton @wking

@@ -241,6 +241,9 @@ objects:
#!/bin/sh
export _CI_ONLY_STAY_AWAY_OPENSHIFT_INSTALL_AWS_USER_TAGS="{\"expirationDate\": \"$(date -d '4 hours' --iso=minutes --utc)\"}"
exec /bin/openshift-install --dir=/tmp/artifacts/installer --log-level=debug cluster
if [ $? -ne 0 ]; then
touch /tmp/shared/exit
fi
Copy link
Member

@wking wking Oct 17, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We still want this shell script to exit non-zero on failures. How about we restore the line I removed here when trying to drop the shell from the setup container (which we backed away from in #1761). But with [ instead of [[ because we're no longer Bash (or change the shebang to make this script Bash again as well).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

While we're at it, we probably want to restore the TERM handler too.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That sounds more robust... I'm out of time today, so I'll close this PR and either re-open with your fix or leave it close in favor of a replacement from you- whichever comes first. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants