-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] - Argo-Workflow pods linger after completed workflows on GCP #1573
Comments
@iameskild Is this the hello world example you ran? apiVersion: argoproj.io/v1alpha1
kind: Workflow
metadata:
generateName: hello-world-
labels:
workflows.argoproj.io/archive-strategy: "false"
annotations:
workflows.argoproj.io/description: |
This is a simple hello world example.
You can also run it in Python: https://couler-proj.github.io/couler/examples/#hello-world
spec:
entrypoint: whalesay
templates:
- name: whalesay
container:
image: docker/whalesay:latest
command: [cowsay]
args: ["hello world"] When I run this via the Argo UI, I see no log output, even after 15 minutes, so I'm curious why you have log messages and I don't. Maybe you have elevated permissions. It seems that the workflow completed for you, but not for me. |
Hey @ericdatakelly I have been submitting the |
@iameskild FYI: I ran the same workflow and waited 20 minutes, but I still see no logs. (for anyone following along, the workflow is the default workflow that is populated when the manifest editor is opened) |
After looking into this some more, I believe the issue lies with the The docs for the workflow executors also call out that the After some more testing, I will open a PR for this fix. |
Thanks @iameskild ! I just tried the default (hello argo) workflow and it shows that it succeeded. I think you are ok to open the PR unless you want to wait for me to make a custom workflow and test that. |
Thanks @ericdatakelly! I just opened the PR for the fix 👍 |
Describe the bug
When running a workflow via Argo-Workflows, the associated pod seems to linger even after the workflow successfully completed.
Expected behavior
Once the workflow is completed (either failed or successful), the associated pod should stop running as well.
OS and architecture in which you are running Nebari
GCP GKE
How to Reproduce the problem?
Run the
hello-argo
example workflow from the/argo
UI. This seems to be only happening on Nebari clusters running on GCP.Command output
Versions and dependencies used.
No response
Compute environment
No response
Integrations
No response
Anything else?
By looking at
k9s
, the workflow pod it appears that only 1 of 2 init containers completed before the workflow started. This second init container might be waiting for a signal that it will never receive.The text was updated successfully, but these errors were encountered: