Track number of tasks in executor as metric in scheduler job #29579

pdebelak · 2023-02-16T18:58:12Z

This ensures that the stats gauge for scheduler.tasks.running isn't always 0.

I was unclear of how to set things up in a test to test this behavior, so any advice there would be welcome. This is based on the old behavior of how this worked before the incrementing of num_tasks_in_executor was deleted, but if there is a better way to do this I'm happy to update.

This ensures that the stats gauge for scheduler.tasks.running isn't always 0.

Taragolis · 2023-02-18T15:05:04Z

PR which added HA Scheduler and removed calculation of this metric merged 2 years ago #10956, so I guess it is very small chance that @ashb remember why it removed.

I just assume that instead of validate is this TI queued or running could we just use len(self.executor.queued_tasks) + len(self.executor.running) as value for this metric. But I'm not familiar with SchedulerJob so better hear opinion from someone who more confident in this field rather than me.

ashb · 2023-02-18T15:16:02Z

Memory of this is hazy - I guess the main issue is how you will behave if you have more than one scheduler

pdebelak · 2023-02-18T15:16:03Z

@Taragolis - it is also entirely possible that the right thing to do is just delete this metric and the docs for it, given that it has been wrong for so long.

Taragolis · 2023-02-18T16:37:48Z

That could be an option if we really do not know is is possible to have more or less truthfully value for this metric

potiuk · 2023-02-20T20:24:29Z

Memory of this is hazy - I guess the main issue is how you will behave if you have more than one scheduler

I think there should be a way to distinguish schedulers and have different metrics for them. But this is tricky with our approach where we can just literally start running yet-another-scheduler (and the bad thing in this context is that we do not even know how many schedulers we are running).

I'd be for dropping this metrics altogether

pierrejeambrun · 2023-02-24T21:01:26Z

I agree. This looks tricky to get right, meanwhile it's been broken for long. I would be for dropping it.

kaxil · 2023-03-08T23:31:11Z

I will be up for removing it.

separately, most (or all) of the scheduler metrics might suffer from the problem when run in HA of reporting only their number without identifying what comes from where. Adding a suffix of scheduler_job_id to those metrics is a potential solution

potiuk · 2023-03-10T13:23:34Z

I will be up for removing it.

separately, most (or all) of the scheduler metrics might suffer from the problem when run in HA of reporting only their number without identifying what comes from where. Adding a suffix of scheduler_job_id to those metrics is a potential solution

Let's remove it then.

pdebelak requested review from XD-DENG, ashb and kaxil as code owners February 16, 2023 18:58

boring-cyborg bot added the area:Scheduler including HA (high availability) scheduler label Feb 16, 2023

Track number of tasks in executor as metric in scheduler job

15532a7

This ensures that the stats gauge for scheduler.tasks.running isn't always 0.

pdebelak force-pushed the tasks-running-gauge-increment branch from 1a6a7ff to 15532a7 Compare February 16, 2023 18:58

pdebelak closed this Mar 10, 2023

vincbeck mentioned this pull request Mar 30, 2023

Remove gauge scheduler.tasks.running #30374

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track number of tasks in executor as metric in scheduler job #29579

Track number of tasks in executor as metric in scheduler job #29579

Uh oh!

pdebelak commented Feb 16, 2023

Uh oh!

Taragolis commented Feb 18, 2023

Uh oh!

ashb commented Feb 18, 2023

Uh oh!

pdebelak commented Feb 18, 2023

Uh oh!

Taragolis commented Feb 18, 2023

Uh oh!

potiuk commented Feb 20, 2023 •

edited

Loading

Uh oh!

pierrejeambrun commented Feb 24, 2023 •

edited

Loading

Uh oh!

kaxil commented Mar 8, 2023

Uh oh!

potiuk commented Mar 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Track number of tasks in executor as metric in scheduler job #29579

Track number of tasks in executor as metric in scheduler job #29579

Uh oh!

Conversation

pdebelak commented Feb 16, 2023

Uh oh!

Taragolis commented Feb 18, 2023

Uh oh!

ashb commented Feb 18, 2023

Uh oh!

pdebelak commented Feb 18, 2023

Uh oh!

Taragolis commented Feb 18, 2023

Uh oh!

potiuk commented Feb 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pierrejeambrun commented Feb 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaxil commented Mar 8, 2023

Uh oh!

potiuk commented Mar 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

potiuk commented Feb 20, 2023 •

edited

Loading

pierrejeambrun commented Feb 24, 2023 •

edited

Loading