Add systemd uptime metric collection #952

langesven · 2018-05-24T10:03:26Z

The no longer available systemd_exporter provided uptime metrics for systemd services, as such I figured I'd propose this as an extension to node_epxorter.
This is somewhat relevant to #562 in that it exposes how long a service has been running for, not quite the timestamp of its start.
Exporting the duration makes it easier to define alerts on this metric in my opinion and would retain the functionality as provided by the previous project.

discordianfish · 2018-05-30T17:06:44Z

@langesven Thanks for the PR!
Though I think we should expose the start timestamp since it's much more efficient for prometheus to store. To alert on the duration, you can alert on time() - timestamp_metric.
See also this older, outdated PR with similar functionality: #709

langesven · 2018-06-04T10:40:33Z

Hi @discordianfish - fair enough, I changed it to a CounterValue that is either 0 or the unix timestamp of when the job was started. Or would a Gauge be better suited but still keep track of just the timestamp?

arianvp · 2018-06-25T06:29:02Z

Note that there is also this PR #709

discordianfish · 2018-06-25T18:06:05Z

@langesven After some back and forth we decided to use gauges for timestamps in general, so yeah please change this to a gauge.

discordianfish

Looking good in general, but we should also have some tests if possible.

discordianfish · 2018-06-25T18:04:26Z

collector/systemd_linux.go

@@ -55,6 +56,10 @@ func NewSystemdCollector() (Collector, error) {
 		prometheus.BuildFQName(namespace, subsystem, "unit_state"),
 		"Systemd unit", []string{"name", "state"}, nil,
 	)
+	unitStartedAtDesc := prometheus.NewDesc(
+		prometheus.BuildFQName(namespace, subsystem, "unit_started_at_seconds"),


Let's call this unit_start_time_seconds to be consistent with already existing process_start_time_seconds.

@discordianfish That won't work, because process_start_time_seconds metric family has no labels, so the collector will fail.

I think for consistency, we could name it unit_start_time_seconds.

@SuperQ That's what I suggested! Should have put it in ```` though :)

Sorry! I didn't read what you said closely enough. 🦆

discordianfish · 2018-07-03T08:37:41Z

@langesven Can you change the metric name? Beside this LGTM
/cc @SuperQ

langesven · 2018-07-05T13:03:59Z

Thanks for your feedback.
I changed the metric name to unit_start_time_seconds and made it a GaugeValue instead of a counter!

SuperQ

LGTM

SuperQ · 2018-07-05T13:46:48Z

Oh, would you mind adding a [FEATURE] entry to the changelog?

langesven · 2018-07-16T12:45:45Z

Added a feature changelog entry and rebased against master

discordianfish · 2018-07-16T16:29:06Z

@langesven Looks like this still (again?) needs rebasing

Signed-off-by: Sven Lange <tdl@hadiko.de>

langesven · 2018-07-17T11:28:50Z

@discordianfish it was indeed again 😄 rebased

* Add systemd uptime metric collection Signed-off-by: Sven Lange <tdl@hadiko.de>

langesven force-pushed the systemd-uptime-metrics branch from 41d45c1 to 8ae6035 Compare May 24, 2018 11:33

langesven force-pushed the systemd-uptime-metrics branch from 8ae6035 to 1c2a5a7 Compare June 4, 2018 10:38

discordianfish reviewed Jun 25, 2018

View reviewed changes

langesven force-pushed the systemd-uptime-metrics branch from 1c2a5a7 to 867690e Compare July 5, 2018 13:03

SuperQ approved these changes Jul 5, 2018

View reviewed changes

langesven force-pushed the systemd-uptime-metrics branch from 77fdded to 388b97b Compare July 16, 2018 12:42

langesven added 2 commits July 17, 2018 13:26

Add systemd uptime metric collection

32e9a76

Signed-off-by: Sven Lange <tdl@hadiko.de>

Add feature entry to CHANGELOG

7f505fb

Signed-off-by: Sven Lange <tdl@hadiko.de>

langesven force-pushed the systemd-uptime-metrics branch from 388b97b to 7f505fb Compare July 17, 2018 11:27

SuperQ merged commit 2ae8c1c into prometheus:master Jul 18, 2018

SuperQ mentioned this pull request Sep 11, 2018

systemd_collector should export timestamps service started at and last ran for timer #562

Closed

pgier mentioned this pull request Feb 6, 2019

Systemd refactor #1254

Merged

oblitorum pushed a commit to shatteredsilicon/node_exporter that referenced this pull request Apr 9, 2024

Add systemd uptime metric collection (prometheus#952)

19951cc

* Add systemd uptime metric collection Signed-off-by: Sven Lange <tdl@hadiko.de>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add systemd uptime metric collection #952

Add systemd uptime metric collection #952

langesven commented May 24, 2018

discordianfish commented May 30, 2018

langesven commented Jun 4, 2018

arianvp commented Jun 25, 2018

discordianfish commented Jun 25, 2018

discordianfish left a comment

discordianfish Jun 25, 2018

SuperQ Jul 3, 2018

SuperQ Jul 3, 2018

discordianfish Jul 5, 2018

SuperQ Jul 5, 2018

discordianfish commented Jul 3, 2018

langesven commented Jul 5, 2018

SuperQ left a comment

SuperQ commented Jul 5, 2018

langesven commented Jul 16, 2018

discordianfish commented Jul 16, 2018

langesven commented Jul 17, 2018

Add systemd uptime metric collection #952

Add systemd uptime metric collection #952

Conversation

langesven commented May 24, 2018

discordianfish commented May 30, 2018

langesven commented Jun 4, 2018

arianvp commented Jun 25, 2018

discordianfish commented Jun 25, 2018

discordianfish left a comment

Choose a reason for hiding this comment

discordianfish Jun 25, 2018

Choose a reason for hiding this comment

SuperQ Jul 3, 2018

Choose a reason for hiding this comment

SuperQ Jul 3, 2018

Choose a reason for hiding this comment

discordianfish Jul 5, 2018

Choose a reason for hiding this comment

SuperQ Jul 5, 2018

Choose a reason for hiding this comment

discordianfish commented Jul 3, 2018

langesven commented Jul 5, 2018

SuperQ left a comment

Choose a reason for hiding this comment

SuperQ commented Jul 5, 2018

langesven commented Jul 16, 2018

discordianfish commented Jul 16, 2018

langesven commented Jul 17, 2018