-
Notifications
You must be signed in to change notification settings - Fork 702
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add warn event and directly return without creating pods for job validation failure #1564
Add warn event and directly return without creating pods for job validation failure #1564
Conversation
Hi @cheimu. Thanks for your PR. I'm waiting for a kubeflow member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Pull Request Test Coverage Report for Build 2039519742
💛 - Coveralls |
/ok-to-test /assign @zw0610 |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: cheimu The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Generally LGTM. It would be better if test cases are included as well. |
Oh yeah, you are right, let me add tests |
/retest |
Done |
/retest same reason. |
…thub.com/cheimu/training-operator into fix/tfjob-add-warn-for-failed-validation
@cheimu: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
Signed-off-by: Peng Gao <peng.gao.dut@gmail.com>
* Deprecate training-operator presubmit on optional-test-infra This PR serves as sub-PR to deprecate training-operator presubmit on optional-test-infra. * Update config file Update workflow format
* Adding latest image tag * Update manifests with latest image tag * Adding integration tests * Change trigger type
Currently certain operations like tailing logs from the Python sdk against the latest version of the operator due to a label mistmatch; fix that. Closes kubeflow#1587.
* Adding latest image tag * Update manifests with latest image tag * Update k8s dependencies to v0.24.1 * Update manifests * Update k8s matrix for integration tests * Update k8s matrix for unit tests * Fix k8s versions * Fix version * Add scripts in separate file * Fix Makefile * Cleanup Makefile * Addressing review comments
/retest |
@johnugeorge Could you advise how to handle stalled pull requests because of the failed ci like this one after the migration to GHA? |
it should trigger new GHA workflow on a rebase @cheimu Can you rebase? |
…thub.com/cheimu/training-operator into fix/tfjob-add-warn-for-failed-validation
:( sad for myself for a second. I'll open a new pr and close this one later... |
@cheimu can you do it at the earliest as we are planning to have feature release by end of next week? |
Hi @johnugeorge , I'm afraid I can't make it this week... I'll try my best though |
Closing it in favour of #1704 |
@johnugeorge: Closed this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
What this PR does / why we need it:
Currently, all jobs' controllers validate job spec but only log an error in
training-operator
, butWhich issue(s) this PR fixes (optional, in
Fixes #<issue number>, #<issue number>, ...
format, will close the issue(s) when PR gets merged):Fixes #1563
Checklist: