Added metrics to Scheduler and track in Process state #984

AndersGM · 2023-06-19T20:20:24Z

bensheldon

I'm happy to accept this. Briefly, it needs:

threadsafety
have the value show up when the Process serializes the Schedulers so that it can be queried
I think the value should be reset when the the Scheduler is restarted (otherwise it might get weird when forking, especially with Puma's worker_fork

lib/good_job/metrics.rb

AndersGM · 2023-06-20T07:54:08Z

@bensheldon Thank you for taking your time to look into this! Regarding the serialization - as far as I can see process uses name when serializing, so I've added the metrics to the name here in 78d8aba. Not sure if that is the best place though, but not sure where else to put it, without messing up the enclosing parentheses. What do you think? Also considered adding something like:

succeeded_count: GoodJob::Scheduler.instances.sum(&:succeeded_count),
failed_count: GoodJob::Scheduler.instances.sum(&:failed_count),

to Process.current_state.

bensheldon · 2023-06-20T21:54:31Z

@AndersGM Good things to flag! What do you think about making a method like Scheduler#statistics that contained the name and those values and then dropping that into the Process state? (I also think they should be succeeded_executions_count for clarity).

Ok, and then I have a totally random thought that I want to share, but isn't telling you to do instead: What if we tracked executed_by_process_id on the Execution/DiscreteExecution object in the database? Would that be equivalent? (it could be joined in the database against the Process record)

AndersGM · 2023-06-20T22:51:47Z

@bensheldon That would be better, but maybe not statistics since there already is a method named stats so it might be confusing. What about name_with_metrics or just adding a parameter include_metrics to the name method? Agree about the more longer more explicit variable names (succeeded_executions_count).

Sound interesting. I actually do something similar in one of my apps that then sends the metrics to prometheus. However, the issue is that the executions can be destroyed through the UI (and cleanup?). Consequently, the resulting counter could actually decrement, without being reset. That is the reason I implemented it this way, to avoid that. Did you think of another way or have I misunderstood something? :-)

app/models/good_job/process.rb

bensheldon · 2023-07-01T17:51:37Z

@AndersGM I made a few tweaks and pushed this back up. It's looking great!

I just merged in #977 which will update the Process record in the database every 30 seconds.

One thing I noticed that I'd like your feedback on. Currently this PR only records a job failure if it bubbles up to the thread-handler. This will cover only a small number of execution failures. We want to cover all the circumstances where a job's execution does not succeed, right? (e.g. if an execution fails and is retried, we'd still count that). That seem right? If you agree, I can make that tweak.

AndersGM · 2023-07-01T18:25:15Z

@bensheldon indeed - thank you! I wasn't aware of that - let me know if I can be of any help!

lib/good_job/metrics.rb

app/models/good_job/execution_result.rb

AndersGM force-pushed the metrics branch 2 times, most recently from 9173934 to d6e2587 Compare June 19, 2023 20:27

added metrics

b657d1f

AndersGM force-pushed the metrics branch from d6e2587 to b657d1f Compare June 19, 2023 21:02

bensheldon reviewed Jun 19, 2023

View reviewed changes

lib/good_job/metrics.rb Outdated Show resolved Hide resolved

AndersGM added 5 commits June 20, 2023 09:04

thread safe counters

3111a28

reset metrics after restart

1f3ae5f

specced both error and success in #task_observer

2960a86

added doc

30db43a

include metrics in name

78d8aba

Merge branch 'main' into metrics

042ed03

AndersGM added 3 commits June 25, 2023 20:34

renamed and reverted

17e5bd7

Merge remote-tracking branch 'fork/metrics' into metrics

02dae4a

consistent naming

afeae87

AndersGM commented Jun 25, 2023

View reviewed changes

app/models/good_job/process.rb Show resolved Hide resolved

AndersGM added 2 commits June 25, 2023 20:54

fixe broken current state

2c3c21e

Merge branch 'main' into metrics

6069cac

AndersGM marked this pull request as ready for review June 25, 2023 18:56

AndersGM requested a review from bensheldon June 25, 2023 18:56

bensheldon added 2 commits July 1, 2023 10:23

Merge remote-tracking branch 'origin/main' into metrics

822a0be

Track Scheduler name and queues separately in stats and display

e9a26d2

Fix Scheduler tests

28b99ad

bensheldon added 3 commits July 1, 2023 11:36

Remove default value for state

6c9af51

Renamed failed_ to errored_

3ca81d4

Track empty and unlocked executions too

e59c8da

bensheldon added 4 commits July 1, 2023 12:39

Have stat totals add up consistently

74ad726

Ensure stats can be reported when shutdown

6fe747a

Don't add object to instances list until fully initialized

169a23c

Fix race condition in Scheduler#stats test

0964768

AndersGM commented Jul 2, 2023

View reviewed changes

lib/good_job/metrics.rb Show resolved Hide resolved

app/models/good_job/execution_result.rb Outdated Show resolved Hide resolved

Rename "unlocked" to "unexecutable" and fix counting

b8f6f08

bensheldon added the enhancement New feature or request label Jul 4, 2023

bensheldon changed the title ~~Added metrics to scheduler~~ Added metrics to scheduler and track in Process state Jul 4, 2023

bensheldon changed the title ~~Added metrics to scheduler and track in Process state~~ Added metrics to Scheduler and track in Process state Jul 4, 2023

bensheldon merged commit caf8005 into bensheldon:main Jul 4, 2023

AndersGM deleted the metrics branch July 4, 2023 21:27

AndersGM mentioned this pull request Jul 7, 2023

Add debugging and telemetry stats #532

Open

AndersGM mentioned this pull request Jan 25, 2024

Added basic GoodJob collectors discourse/prometheus_exporter#280

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added metrics to Scheduler and track in Process state #984

Added metrics to Scheduler and track in Process state #984

AndersGM commented Jun 19, 2023 •

edited

Loading

bensheldon left a comment

AndersGM commented Jun 20, 2023 •

edited

Loading

bensheldon commented Jun 20, 2023

AndersGM commented Jun 20, 2023 •

edited

Loading

bensheldon commented Jul 1, 2023

AndersGM commented Jul 1, 2023

Added metrics to Scheduler and track in Process state #984

Added metrics to Scheduler and track in Process state #984

Conversation

AndersGM commented Jun 19, 2023 • edited Loading

bensheldon left a comment

Choose a reason for hiding this comment

AndersGM commented Jun 20, 2023 • edited Loading

bensheldon commented Jun 20, 2023

AndersGM commented Jun 20, 2023 • edited Loading

bensheldon commented Jul 1, 2023

AndersGM commented Jul 1, 2023

AndersGM commented Jun 19, 2023 •

edited

Loading

AndersGM commented Jun 20, 2023 •

edited

Loading

AndersGM commented Jun 20, 2023 •

edited

Loading