Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support a new display status to distinguish "QUEUED" vs. "RUNNING" #204

Open
mbookman opened this issue Jul 16, 2020 · 3 comments
Open

Support a new display status to distinguish "QUEUED" vs. "RUNNING" #204

mbookman opened this issue Jul 16, 2020 · 3 comments

Comments

@mbookman
Copy link
Contributor

For historical reasons (initial support from the Pipelines API v1alpha2), dstat provides no distinction between tasks that are QUEUED vs. actually RUNNING. Today they are all listed as RUNNING.

We should surface this distinction in places where the status is displayed. It can be misleading to new users when they submit a large number of tasks, it may appear that some are taking a very long time to run when they are actually queued and blocked by other running tasks that are consuming Compute Engine Quota (such as CPU, Persistent Disk, or In-Use IP Addresses).

@mbookman
Copy link
Contributor Author

mbookman commented Jul 16, 2020

Until this is surfaced in dstat output, users can use one of:

to see the number of concurrently running VMs.

@sjgosai
Copy link

sjgosai commented Dec 26, 2022

Clarification question: Are jobs that are QUEUED due to resource quotas, but listed as RUNNING (i.e. Status: VM starting (awaiting worker checkin)) constrained by the same timeout restriction as jobs that are actually running?

@wnojopra
Copy link
Contributor

wnojopra commented Jan 3, 2023

Clarification question: Are jobs that are QUEUED due to resource quotas, but listed as RUNNING (i.e. Status: VM starting (awaiting worker checkin)) constrained by the same timeout restriction as jobs that are actually running?

Yes, the timeout includes the time spent waiting for a worker to be allocated. The default timeout is seven days which can be changed with the --timeout flag. See the provider specific parameters section in provider docs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants