Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
pkg/alertmanager: Use lower value for --cluster.reconnect-timeout
In a high-dynamic environment like kubernetes, it's possible that alertmanager pods come and go on frequent intervals. The default timeout value of 6h is not suitable in that case as alertmanager will keep trying to reconnect to a non-existing pod over and over until it gives up and goes through another DNS resolution process. As such, it's best to use a lower value which will allow the alertmanager cluster to recover in case of an update/rollout/etc process in the kubernetes cluster. Related: prometheus/alertmanager#2250
- Loading branch information