[service] Avoid failures of `service` resource with frequent restarts #469

olivielpeau · 2017-09-21T15:40:24Z

Fixes #467 (see description there)

Happens on systemd-based systems since systemd applies its limits on the number of restarts of a service (by default, 5 times starts every 10 seconds) on both the user-requested restarts and the ones systemd does on its own (when the service fails starting for instance). Root of the problem is that the service resource in `datadog_monitor` is different from the one in the main chef run (chef limitation), so the restarts that happen there are done immediately instead of being queued up nicely at the end of the global chef run. If multiple invocations of this resource are updated in a chef run the restart limit of systemd can be quickly reached. A better fix would be to remove the service definition from `datadog_monitor` and make all invocations of `datadog_monitor` notify the global `service[datadog-agent]` resource. This would be a breaking change, let's do it for the next major version.

olivielpeau added the bug label Sep 21, 2017

olivielpeau added this to the 2.11.0 milestone Sep 21, 2017

olivielpeau merged commit 4c201c5 into master Sep 21, 2017

olivielpeau deleted the olivielpeau/systemd-restart-retries branch September 21, 2017 15:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[service] Avoid failures of `service` resource with frequent restarts #469

[service] Avoid failures of `service` resource with frequent restarts #469

olivielpeau commented Sep 21, 2017

[service] Avoid failures of service resource with frequent restarts #469

[service] Avoid failures of service resource with frequent restarts #469

Conversation

olivielpeau commented Sep 21, 2017

[service] Avoid failures of `service` resource with frequent restarts #469

[service] Avoid failures of `service` resource with frequent restarts #469