Skip to content

Conversation

@ankedia
Copy link
Member

@ankedia ankedia commented Feb 17, 2021

This PR evaluates the job condition in addition to job status to determine if the job has failed. In "etcd restore" test, when job remains in the Pending state for more than ActiveDeadlineSeconds, the job condition is changed to "failed" but the job status remains null. This causes the fiber to get stuck in the WaitForJobReady step.

Copy link
Member

@alai8 alai8 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good. Don't forget to add test case to JobWatcherTest.

@ankedia
Copy link
Member Author

ankedia commented Feb 17, 2021

looks good. Don't forget to add test case to JobWatcherTest.

Thanks. I have added test cases to JobWatcherTest in 4e59b9c and 4b5fad8 .

@rjeberhard rjeberhard merged commit 975ad94 into release/3.1.4 Feb 18, 2021
@ankedia ankedia deleted the fix_failed_job_check branch March 3, 2021 21:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants