`kubectl wait` for un-existed resource. #1516

kvokka · 2019-09-27T12:51:48Z

What happened:

I can not avoid exit with error for uncreated resource, which is misleading by the command name:

kubectl wait --selector=foo=bar --for=condition=complete jobs
kubectl wait --for=condition=complete jobs/foo

Even if this behavior if intentional the user should have an option to continue waiting instead of exit with error code.

In the contrast, if the resource already exists everything works as it should.

What you expected to happen:

Kubectl should at least to have the ability (option) to wait un-existed resource.

Anything else we need to know?:

Connect kubernetes/kubernetes#75227

Environment:

Kubernetes version (use kubectl version): 1.15.2
OS (e.g: cat /etc/os-release): mac Os 10.14.6
Kernel (e.g. uname -a): Darwin Kernel Version 18.7.0

The text was updated successfully, but these errors were encountered:

kvokka · 2019-09-27T12:52:43Z

/sig cli

rikatz · 2019-09-27T22:05:18Z

Can take a look into this.

/assign @rikatz

rikatz · 2019-09-30T13:22:05Z

@kvokka just a question: Deleted conditions are also something you're looking for?

I imagine a situation where you wan't a deleted condition for an object that wasn't even created. This seems pretty strange to me :) but let me know if this is also a scenario.

Tks

kvokka · 2019-09-30T13:43:04Z

@rikatz Thank you for your response!

For me simple wait until timeout is more than enough. If the developer want to control the object persistence/deletion just let him do it. Sounds reasonable?

The example scenario of the expected behaviour is described in this article.

rikatz · 2019-09-30T13:54:49Z

Right.

It might be pretty trickier than I thought it was, but I'm already taking a look.

The biggest problem is that the function used to 'visit' an object expects it to exists (ResourceFinder.Do().Visit) so I'm taking a look to check if it's possible to 'bypass/loop' into it ;)

rikatz · 2019-09-30T20:02:59Z

I've made an initial and dirty PR just to see if this is the path to follow :D

kvokka · 2019-10-01T04:47:39Z

Thank you for the contribution! Will hope the code will be reviewed/merged soon! :)

rikatz · 2019-10-02T20:24:05Z

OK, so I need some review :/ The dumbest way is what I did...A sleep with 1s. Not sure why ResourceFinder is used here and if something more "flexible" could be used, so need someone with more experience in Kubernetes Code to review that.

fejta-bot · 2019-12-31T21:13:01Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

rikatz · 2020-01-02T20:46:06Z

/remove-lifecycle stale

Got some time to resolve some other stuff, but this is still a thing

fejta-bot · 2020-04-01T21:06:10Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

jemc · 2020-04-01T21:16:14Z

/remove-lifecycle stale

lopezator · 2020-06-05T09:52:24Z

Had the same problem, I am using an script to create all my k8s objects, then wait for a particular pod to be ready.

I have a race condition, when the wait executes the object apparently is not yet created.

This issue force me to sleep for a few seconds before issuing the wait.

kubectl apply -f foo.yaml
sleep 5 # Just to avoid the wait err below
kubectl -n ns wait pod --for=condition=ready -l name=pod --timeout=120s

remidebette · 2023-06-06T11:08:02Z

/remove-lifecycle stale

lbruun · 2023-09-28T14:55:05Z

Here is my current workaround using Bash. It differs from the other workarounds published here in that it has a max-wait-time. It uses the return code from kubectl describe to figure out if the resource exist or not.

# Wait for Pods with a certain name prefix to exist.
# The wait is over when at least one Pod exist or the max wait time is reached.
#
#   $1 : namespace
#   $2 : pod name prefix (wildcards not allowed)
#   $3 : the maximum time to wait in seconds
#
# The command is useful in combination with 'kubectl wait' which can wait for a certain condition,
# but cannot wait for existence.
wait_for_pods_to_exist() {
  local ns=$1
  local pod_name_prefix=$2
  local max_wait_secs=$3
  local interval_secs=2
  local start_time=$(date +%s)
  while true; do

    current_time=$(date +%s)
    if (( (current_time - start_time) > max_wait_secs )); then
      echo "Waited for pods in namespace \"$ns\" with name prefix \"$pod_name_prefix\" to exist for $max_wait_secs seconds without luck. Returning with error."
      return 1
    fi

    if kubectl -n $ns describe pod $pod_name_prefix --request-timeout "5s"  &> /dev/null; then
      break
    else
      sleep $interval_secs
    fi
  done
}

.. and use like this in combination with kubectl wait:

# wait up 20 secs for Pod to exist:
wait_for_pods_to_exist "mynamespace" "mypodname" 20
#  ...and then wait for state 'Ready' for up to 30 secs:
kubectl --namespace "mynamespace" wait --for=condition=ready pod/mypodname

(I haven't found a need to wait for any other resource type than pods, but the above can easily be generalized)

I truly believe that 'wait-for-existence' should be something kubectl should be capable of instead of all of us inventing this on our own. To make it clear, people seem to have needs such as:

Wait for at least 1 to exist (my requirement)
Wait for exactly X to exist (including zero, for example as described here)

Programmatically, it is easier done in kubectl, i.e. in Go code, than in say Bash. But the main argument for putting it in kubectl is that it seems to be a recurring theme among users.

mrclrchtr · 2023-10-02T16:31:20Z

@lbruun thx a lot for the script.

A little better could be to use the following instead of waiting for ready state:

kubectl rollout status --namespace mynamespace deployment/mydeploymentname --timeout=600s

This waits for the pod to be completely ready. With waiting only for the ready state, I had problems because it was not completely deployed.

ardaguclu · 2023-11-10T11:27:08Z

It seems that there is a high demand for that feature and I'd like to try my chance to propose a viable solution. For the backwards compatibility, it would be better to introduce a new flag for this functionality and keep the current behavior without it;

/assign

ardaguclu · 2023-11-10T11:31:05Z

/triage accepted
/priority backlog
/transfer kubectl

djmcgreal-cc · 2024-01-25T10:48:08Z

@ardaguclu - any update? Thanks :)

ardaguclu · 2024-01-25T10:59:02Z

Thanks for the reminder because I forgot this issue :). I'll prioritize this.

seastco · 2024-09-17T21:22:58Z

this would be great to have. just ran into this and had to cook up some ugly bash scripting.

Skaronator · 2024-09-17T21:30:56Z

@seastco as you can see this issue has been already fixed and closed. Its available in: v1.32.0-alpha.0 v1.31.1 v1.31.0 v1.31.0-rc.1 v1.31.0-rc.0 v1.31.0-beta.0 v1.31.0-alpha.3

as you can see in commit kubernetes/kubernetes@b95fce1.

ein-stein-chen · 2024-09-18T12:07:59Z

Note that kubernetes/kubernetes@e24b9a0 (the actual commit implementing this feature) was reverted with kubernetes/kubernetes#125630 and the follow-up attempt (kubernetes/kubernetes#125632) was closed with the plan to instead expand the --for argument to also cover this use case.

So this issue probably should be reopened.

ardaguclu · 2024-09-18T12:17:08Z

This kubernetes/kubernetes#125868 was the final decision and can be used.

In the UDN tests we were using the wait command to pull the condition of the UDN to see if its ready or created. However if the UDN doesn't exist, then wait won't poll and retry it will immediately exit because it assumes the resource exists already. See kubernetes/kubectl#1516 for details. The recommended fix is to use: kubernetes/kubernetes#125868 which has been added to 1.31 Kube. This PR changes that. Found during CI debugging. See sample flake: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-nightly-4.19-e2e-aws-ovn-cgroupsv1-techpreview/1877195896546922496 blob:https://prow.ci.openshift.org/b8f5e892-9512-4f54-9176-772d131df102 STEP: create tests UserDefinedNetwork @ 01/09/25 04:54:51.335 I0109 04:54:51.335566 82622 builder.go:121] Running '/usr/bin/kubectl --server=https://api.ci-op-i9rwlkcc-1d795.aws-2.ci.openshift.org:6443 --kubeconfig=/tmp/kubeconfig-916539295 --namespace=e2e-test-network-segmentation-e2e-xxcxf create -f /tmp/udn-test1009896207/test-ovn-k-udn-hr84p.yaml' I0109 04:54:51.683868 82622 builder.go:146] stderr: "" I0109 04:54:51.683898 82622 builder.go:147] stdout: "userdefinednetwork.k8s.ovn.org/test-net created\n" I0109 04:54:51.684017 82622 builder.go:121] Running '/usr/bin/kubectl --server=https://api.ci-op-i9rwlkcc-1d795.aws-2.ci.openshift.org:6443 --kubeconfig=/tmp/kubeconfig-916539295 --namespace=e2e-test-network-segmentation-e2e-xxcxf wait userdefinednetwork test-net --for condition=NetworkReady=True --timeout 5s' I0109 04:54:52.088420 82622 builder.go:135] rc: 1 [FAILED] in [BeforeEach] - github.com/openshift/origin/test/extended/networking/network_segmentation.go:568 @ 01/09/25 04:54:52.088 STEP: Collecting events from namespace "e2e-test-network-segmentation-e2e-xxcxf". @ 01/09/25 04:54:52.088 STEP: Found 0 events. @ 01/09/25 04:54:52.118 I0109 04:54:52.136664 82622 resource.go:168] POD NODE PHASE GRACE CONDITIONS I0109 04:54:52.136718 82622 resource.go:178] I0109 04:54:52.199221 82622 dump.go:81] skipping dumping cluster info - cluster too large I0109 04:54:52.272872 82622 client.go:638] Deleted {user.openshift.io/v1, Resource=users e2e-test-network-segmentation-e2e-xxcxf-user}, err: <nil> I0109 04:54:52.322560 82622 client.go:638] Deleted {oauth.openshift.io/v1, Resource=oauthclients e2e-client-e2e-test-network-segmentation-e2e-xxcxf}, err: <nil> I0109 04:54:52.365090 82622 client.go:638] Deleted {oauth.openshift.io/v1, Resource=oauthaccesstokens sha256~s7HMlKu1k8aprv5mU8JP_uzaAAuujg4_SOH_ds56VPk}, err: <nil> STEP: Collecting events from namespace "e2e-test-monitoring-collection-profiles-vcqhj". @ 01/09/25 04:54:52.365 STEP: Found 0 events. @ 01/09/25 04:54:52.399 I0109 04:54:52.432784 82622 resource.go:168] POD NODE PHASE GRACE CONDITIONS I0109 04:54:52.432808 82622 resource.go:178] I0109 04:54:52.461376 82622 dump.go:81] skipping dumping cluster info - cluster too large I0109 04:54:52.487845 82622 client.go:638] Deleted {user.openshift.io/v1, Resource=users e2e-test-monitoring-collection-profiles-vcqhj-user}, err: <nil> I0109 04:54:52.536344 82622 client.go:638] Deleted {oauth.openshift.io/v1, Resource=oauthclients e2e-client-e2e-test-monitoring-collection-profiles-vcqhj}, err: <nil> I0109 04:54:52.599493 82622 client.go:638] Deleted {oauth.openshift.io/v1, Resource=oauthaccesstokens sha256~hBHLTqI3PVJQevO8vllBEBy8FwOsd3l1fvTZcYz9Xxw}, err: <nil> STEP: Destroying namespace "e2e-test-network-segmentation-e2e-xxcxf" for this suite. @ 01/09/25 04:54:52.599 STEP: Destroying namespace "e2e-test-monitoring-collection-profiles-vcqhj" for this suite. @ 01/09/25 04:54:52.62 • [FAILED] [4.347 seconds] [sig-network][OCPFeatureGate:NetworkSegmentation][Feature:UserDefinedPrimaryNetworks] when using openshift ovn-kubernetes UserDefinedNetwork [BeforeEach] pod connected to UserDefinedNetwork cannot be deleted when being used [Suite:openshift/conformance/parallel] [BeforeEach] github.com/openshift/origin/test/extended/networking/network_segmentation.go:563 [It] github.com/openshift/origin/test/extended/networking/network_segmentation.go:612 [FAILED] Expected success, but got an error: <exec.CodeExitError>: error running /usr/bin/kubectl --server=https://api.ci-op-i9rwlkcc-1d795.aws-2.ci.openshift.org:6443 --kubeconfig=/tmp/kubeconfig-916539295 --namespace=e2e-test-network-segmentation-e2e-xxcxf wait userdefinednetwork test-net --for condition=NetworkReady=True --timeout 5s: Command stdout: stderr: Error from server (NotFound): userdefinednetworks.k8s.ovn.org "test-net" not found error: exit status 1 { Err: <*errors.errorString | 0xc001de3990>{ s: "error running /usr/bin/kubectl --server=https://api.ci-op-i9rwlkcc-1d795.aws-2.ci.openshift.org:6443 --kubeconfig=/tmp/kubeconfig-916539295 --namespace=e2e-test-network-segmentation-e2e-xxcxf wait userdefinednetwork test-net --for condition=NetworkReady=True --timeout 5s:\nCommand stdout:\n\nstderr:\nError from server (NotFound): userdefinednetworks.k8s.ovn.org \"test-net\" not found\n\nerror:\nexit status 1", }, Code: 1, } as you can see the UDN was created and applied and yet when tried to be fetched it returned "not found" and consecutively we quit immediately instead of retrying. Signed-off-by: Surya Seetharaman <suryaseetharaman.9@gmail.com>

kvokka added the kind/bug Categorizes issue or PR as related to a bug. label Sep 27, 2019

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Sep 27, 2019

k8s-ci-robot added sig/cli Categorizes an issue or PR as relevant to SIG CLI. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Sep 27, 2019

liggitt added kind/feature Categorizes issue or PR as related to a new feature. and removed kind/bug Categorizes issue or PR as related to a bug. labels Sep 27, 2019

k8s-ci-robot assigned rikatz Sep 27, 2019

rikatz mentioned this issue Sep 30, 2019

Support for non-exitent objects into kubectl wait kubernetes/kubernetes#83335

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 31, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 2, 2020

liggitt mentioned this issue Jan 18, 2020

kubectl wait should wait for resources to be available kubernetes/kubernetes#87352

Closed

tamalsaha mentioned this issue Jan 20, 2020

kubectl wait should wait for non-existent resources. kubernetes/kubernetes#87399

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 1, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 1, 2020

rikatz removed their assignment Apr 3, 2020

howardjohn mentioned this issue Jun 9, 2020

Provide meaningful "Ready" status for kubectl wait when using operator kiali/kiali#2885

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 6, 2023

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 6, 2023

shirady mentioned this issue Jun 27, 2023

CI | Update Ceph S3 Tests Workflow noobaa/noobaa-core#7237

Merged

2 tasks

JenySadadia mentioned this issue Sep 6, 2023

kube/minikube: Add k8s manifest files for pipeline kernelci/kernelci-pipeline#322

Merged

k8s-ci-robot assigned ardaguclu Nov 10, 2023

k8s-ci-robot added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Nov 10, 2023

k8s-ci-robot transferred this issue from kubernetes/kubernetes Nov 10, 2023

RonFed mentioned this issue Jan 24, 2024

Reduce flakiness, and print logs in failures open-telemetry/opentelemetry-go-instrumentation#626

Merged

ardaguclu mentioned this issue Jan 26, 2024

kubectl wait: Introduce --wait-for-creation flag kubernetes/kubernetes#122994

Merged

sftim mentioned this issue Jun 12, 2024

Enhancement: kubectl wait --for=existence #1487

Closed

atheo89 mentioned this issue Jun 14, 2024

Intel tensorflow notebook failed to get tested on OCP-CI opendatahub-io/notebooks#562

Open

k8s-ci-robot closed this as completed in kubernetes/kubernetes#122994 Jun 20, 2024

github-project-automation bot moved this from Needs Triage to Closed in SIG CLI Jun 20, 2024

tssurya mentioned this issue Jan 14, 2025

OCPBUGS-48229: Fix wait command in UDN tests openshift/origin#29441

Open

tssurya mentioned this issue Jan 15, 2025

Fix wait in UDN test ovn-kubernetes/ovn-kubernetes#4967

Open

5 tasks

shirady mentioned this issue Jan 16, 2025

PR1 - To install and upgrade noobaa version for running the tests noobaa/noobaa-core#8517

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`kubectl wait` for un-existed resource. #1516

`kubectl wait` for un-existed resource. #1516

kvokka commented Sep 27, 2019

kvokka commented Sep 27, 2019

rikatz commented Sep 27, 2019

rikatz commented Sep 30, 2019

kvokka commented Sep 30, 2019

rikatz commented Sep 30, 2019

rikatz commented Sep 30, 2019

kvokka commented Oct 1, 2019

rikatz commented Oct 2, 2019

fejta-bot commented Dec 31, 2019

rikatz commented Jan 2, 2020

fejta-bot commented Apr 1, 2020

jemc commented Apr 1, 2020

lopezator commented Jun 5, 2020

remidebette commented Jun 6, 2023

lbruun commented Sep 28, 2023

mrclrchtr commented Oct 2, 2023

ardaguclu commented Nov 10, 2023

ardaguclu commented Nov 10, 2023

djmcgreal-cc commented Jan 25, 2024

ardaguclu commented Jan 25, 2024

seastco commented Sep 17, 2024

Skaronator commented Sep 17, 2024

ein-stein-chen commented Sep 18, 2024

ardaguclu commented Sep 18, 2024

kubectl wait for un-existed resource. #1516

kubectl wait for un-existed resource. #1516

Comments

kvokka commented Sep 27, 2019

kvokka commented Sep 27, 2019

rikatz commented Sep 27, 2019

rikatz commented Sep 30, 2019

kvokka commented Sep 30, 2019

rikatz commented Sep 30, 2019

rikatz commented Sep 30, 2019

kvokka commented Oct 1, 2019

rikatz commented Oct 2, 2019

fejta-bot commented Dec 31, 2019

rikatz commented Jan 2, 2020

fejta-bot commented Apr 1, 2020

jemc commented Apr 1, 2020

lopezator commented Jun 5, 2020

remidebette commented Jun 6, 2023

lbruun commented Sep 28, 2023

mrclrchtr commented Oct 2, 2023

ardaguclu commented Nov 10, 2023

ardaguclu commented Nov 10, 2023

djmcgreal-cc commented Jan 25, 2024

ardaguclu commented Jan 25, 2024

seastco commented Sep 17, 2024

Skaronator commented Sep 17, 2024

ein-stein-chen commented Sep 18, 2024

ardaguclu commented Sep 18, 2024

`kubectl wait` for un-existed resource. #1516

`kubectl wait` for un-existed resource. #1516