fix(cron): Log long running jobs #45804

ChristophWurst · 2024-06-12T07:57:46Z

Resolves: production instance with database transactions spanning for days. One theory is that it's a stuck cron job.

Summary

If cron jobs take a very long time to complete they will start to run in parallel. That's because jobs are only reserved for 12h. Afterwards we just assume that the jobs failed and start the job again. In faulty situations that can lead to more and more server load.

Here is an example:

It looks like a job executes an expensive query over and over. After 12h the query time doubles, after 24h it triplicates, after 36h it quadruples, etc. It's not clear if that is what is really happening.

I've tried to be reasonable with the log level so we don't spam the logs too much:

Job executes longer than 5m -> debug
Job executes longer than 20m -> info
Job executes longer than 1h20m -> warning
Job executes longer than 5h20m -> error
Job executes longer than 10h40m -> fatal

TODO

Add logging

Checklist

Code is properly formatted
Sign-off message is added to all commits
Tests (unit, integration, api and/or acceptance) are included
Screenshots before/after for front-end changes
Documentation (manuals or wiki) has been updated or is not required
Backports requested where applicable (ex: critical bugfixes)

Signed-off-by: Christoph Wurst <christoph@winzerhof-wurst.at>

ChristophWurst · 2024-06-12T10:41:14Z

/backport to stable29

ChristophWurst · 2024-06-12T10:41:19Z

/backport to stable28

ChristophWurst · 2024-06-12T10:41:41Z

/backport to stable27

ChristophWurst · 2024-06-13T10:38:08Z

/backport to stable27

AndyScherzinger · 2024-06-28T12:32:04Z

/backport to stable28

marinofaggiana · 2024-07-23T15:11:52Z

@AndyScherzinger can we have a back port for 25 ?

AndyScherzinger · 2024-07-23T15:23:59Z

/backport to stable26

AndyScherzinger · 2024-07-23T15:24:03Z

/backport to stable25

AndyScherzinger · 2024-07-23T15:24:27Z

@marinofaggiana I don't know, let's try and see if the bot can create them 🤞

AndyScherzinger · 2024-07-23T15:29:22Z

So PR could be created but is incomplete according to the bot #46706 @marinofaggiana - best you align with @ChristophWurst to have them wrapped up for 25 and 26

marinofaggiana · 2024-07-23T15:30:12Z

ok

ChristophWurst · 2024-07-24T11:56:35Z

Let's try porting from 27, where I have already had to resolve conflicts: #45855 (comment).

ChristophWurst · 2024-07-24T11:59:27Z

Worked. @AndyScherzinger @marinofaggiana if you need more backports use stable27 as base

fix(cron): Log long running jobs

7ea6eac

Signed-off-by: Christoph Wurst <christoph@winzerhof-wurst.at>

ChristophWurst added bug 3. to review Waiting for reviews labels Jun 12, 2024

ChristophWurst added this to the Nextcloud 30 milestone Jun 12, 2024

ChristophWurst requested review from nickvergessen, joshtrichards, juliusknorr, kesselb and solracsf June 12, 2024 07:57

ChristophWurst self-assigned this Jun 12, 2024

AndyScherzinger approved these changes Jun 12, 2024

View reviewed changes

kesselb approved these changes Jun 12, 2024

View reviewed changes

ChristophWurst merged commit 8e3a049 into master Jun 12, 2024
164 checks passed

ChristophWurst deleted the fix/cron/log-long-running-jobs branch June 12, 2024 10:07

backportbot bot added the backport-request label Jun 12, 2024

backportbot bot mentioned this pull request Jun 12, 2024

[stable29] fix(cron): Log long running jobs #45813

Merged

backportbot bot removed the backport-request label Jun 12, 2024

backportbot bot mentioned this pull request Jun 12, 2024

[stable17] fix(cron): Log long running jobs #45814

Closed

2 tasks

backportbot bot added the backport-request label Jun 13, 2024

backportbot bot mentioned this pull request Jun 13, 2024

[stable27] fix(cron): Log long running jobs #45855

Merged

2 tasks

backportbot bot removed the backport-request label Jun 13, 2024

ChristophWurst mentioned this pull request Jun 13, 2024

[Bug]: Cron runs forever #43201

Closed

8 tasks

backportbot bot added the backport-request label Jun 28, 2024

backportbot bot mentioned this pull request Jun 28, 2024

[stable28] fix(cron): Log long running jobs #46191

Merged

2 tasks

backportbot bot removed the backport-request label Jun 28, 2024

backportbot bot added the backport-request label Jul 23, 2024

backportbot bot mentioned this pull request Jul 23, 2024

[stable26] fix(cron): Log long running jobs #46705

Closed

2 tasks

backportbot bot removed the backport-request label Jul 23, 2024

backportbot bot mentioned this pull request Jul 23, 2024

[stable25] fix(cron): Log long running jobs #46706

Closed

2 tasks

blizzz mentioned this pull request Jul 24, 2024

30.0.0 beta 1 #46713

Merged

This was referenced Aug 7, 2024

[stable25] fix(cron): Log long running jobs #46716

Closed

cron: Long running jobs - Memory & cpu footprint will increase continously #47132

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cron): Log long running jobs #45804

fix(cron): Log long running jobs #45804

ChristophWurst commented Jun 12, 2024 •

edited

Loading

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 13, 2024

AndyScherzinger commented Jun 28, 2024

marinofaggiana commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

marinofaggiana commented Jul 23, 2024

ChristophWurst commented Jul 24, 2024

ChristophWurst commented Jul 24, 2024

fix(cron): Log long running jobs #45804

fix(cron): Log long running jobs #45804

Conversation

ChristophWurst commented Jun 12, 2024 • edited Loading

Summary

TODO

Checklist

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 12, 2024

ChristophWurst commented Jun 13, 2024

AndyScherzinger commented Jun 28, 2024

marinofaggiana commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

AndyScherzinger commented Jul 23, 2024

marinofaggiana commented Jul 23, 2024

ChristophWurst commented Jul 24, 2024

ChristophWurst commented Jul 24, 2024

ChristophWurst commented Jun 12, 2024 •

edited

Loading