Export systemd's own metrics #709

arianvp · 2017-10-21T13:03:05Z

Systemd does all kinds of accounting when IOAccounting, CPUAccounting
and IPAccounting are enabled. This commit exposes some of these.

Also, systemd keeps track of how many times services restart, when
their state changes happens, and a lot more. These metrics are very
useful for detecting problems with running applications.

Still need to write tests, and add some missing metrics so not finished yet.

But I welcome any feedback!

arianvp · 2017-10-21T13:07:38Z

I'll also fix #562 while I'm at it.

arianvp · 2017-10-21T13:50:30Z

While fixing #562, I discovered Systemd exports the required metrics in two flavors: Wall clock time, and Monotonic time. I gues Monotonic is preferred, because then we can export it as a counter, right?

flokli · 2017-10-21T14:02:12Z

collector/systemd_linux.go

+		},
+		"NRestarts": metric{
+			desc: prometheus.NewDesc(
+				prometheus.BuildFQName(namespace, subsystem, "nrestarts"),


Probably n_restarts?

Counters should always end in _total, so this would be restarts_total

flokli · 2017-10-21T14:03:19Z

collector/systemd_linux.go

+func (c *systemdCollector) collectUnitProperiesMetrics(ch chan<- prometheus.Metric) error {
+	conn, err := c.newDbus()
+	if err != nil {
+		return fmt.Errorf("couldn't get dbus connection: %s", err)


This should probably start with an uppercase letter.

SuperQ · 2017-10-21T14:36:45Z

collector/systemd_linux.go

+			desc: prometheus.NewDesc(
+				prometheus.BuildFQName(namespace, subsystem, "ip_ingress_packets_total"),
+				"Ingress packets total",
+				[]string{"name"},


name is too generic, if this is the name of the unit, I would suggest unit or unit_name.

SuperQ · 2017-10-21T14:37:51Z

This seems like an interesting feature, but possibly overlaps too much with cAdvisor? We've been avoiding having the node_exporter duplicate cAdvisor's features.

arianvp · 2017-10-21T17:38:22Z

I'm not very familiar with cAdvisor. Does it just export the cgroup metrics?

Systemd metrics are oriented to a specific unit file, also not limiited to the cgroups, per se.
For example for a .socket unit, the amount of packets is accounted for, and also the amount of restarts a unit file has had is accounted for.

Also, it's orthogonal to containers. Systemd unit files don't have much to do with containers at all I think so I find it hard to see why it would conflict with cAdvisor.

Also, metrics that have nothing to do with cgroups whatsoever are exported as well. Like the timestamp when the latest state change happened (as requested in #562) , and how many times a service restarted.

The systemd collector is not enabled by default, so I would not think it's a problem it overlaps with the cadvisor metrics too much? I think there are many people who are not running containers in production, but do use the systemd init system, and would like to have metrics about their unit files exported.

flokli · 2017-10-21T19:11:56Z

collector/systemd_linux.go

+			),
+			valueType: prometheus.CounterValue,
+		},
+		"AssertTimestampMonotonic": &metric{


This and the following metrics still need to be renamed.

Oh yeh thanks.

SuperQ · 2017-10-22T11:39:56Z

cAdvisor does export metrics from cgroups, as this is how most of the container systems control and account for process groups. Technically, this means that systemd in this case is a type of container system.

I don't think we want to reproduce the cgroup data here, but having systemd-specific metrics like start/stop/restart counts would be good.

flokli · 2017-10-22T19:54:49Z

IMHO, the memory and ingress/egress metrics are also interesting in cases where systemd is not used as a container system, as you just want to monitor resource usage of different processes.

Given those resource metrics are exposed by systemd together with other service-specific metrics and don't introduce any new dependencies, I dont really see an issue in exporting them, without having to install a second container-specific collector ;-)

SuperQ · 2017-10-22T20:03:13Z

The difficulty is that cgroups ~= containers, and cAdvisor provides these metrics. I don't really like it either, but our policy is to not duplicate separate functionality. Really, from a Prometheus perspective we would want a stand-alone systemd exporter, as it's not exactly a "node" specific thing, but is kinda tied 1:1 per node due to the design. This is a fine line, but only due to policy, not technical.

arianvp · 2017-10-22T20:13:46Z

The point that SuperQ is trying to make is. Because cadvisor exports just the cgroup hierarchy, and every systemd service file has its own cgroup systemd services will actually just show up in them. Also the systemd slices will as well. So you could use the cadvisor collector perfectly well to collect systemd service metrics. Actually the systemd metrics are just a shim of the values that the cgroup exports.

So perhaps it's fair to say cadvisor is the better tool for such metrics.

Though, systemd does support some extra features, like keeping track of network traffic for a service which cadvisor does not currently do.

Also it is really nice that we can already get these metrics with just node_exporter, as we have access to these metrics through dbus already anyway - we just discard them currently. It's one runtime dependency less for people, and they already get some accounting of resources out of the box. So perhaps that's good enough reason to do include systemd metrics.

But I'll leave that up to you to decide. I can always adjust the PR to just have the changes needed for #562 and the number of restarts stuff. And then leave out the resource metrics

arianvp · 2017-10-22T20:17:52Z

Edit: the above message I typed parallel to SuperQ's response. We seem to have come to similar conclusions

Shall I adjust this PR to just keep track of the restarts and state changes?

And then send the rest of the patch to the systemd_exporter project instead?
However that would mean there is some overlap between systemd_exporter and node_exporter . Which again gives the same philosophical issue

flokli · 2017-10-23T13:37:17Z

@arianvp I think @SuperQ suggested adding restarts and state changes to node_exporter (this project), and using CAdvisor's prometheus endpoint to export cgroup-related resource metrics to prometheus.

I did not yet use cAdvisor, by a quick glimpse it seems to be much more than just a prometheus exporter, so I'd be tempted to pull this in just to get resource utilization metrics for systemd units.

So maybe moving the systemd restart and state changes stuff into node_exporter and keeping some lightweight cgroup/systemd resource monitoring in a separate project (although systemd_exporter might be misleading) would be what I'd use, in case it's not possible to merge everything into node_exporter.

SuperQ · 2017-10-23T13:39:29Z

There has been some talk about building a new cgroups/container metrics exporter that is more light weight and specific than cAdvisor. Going through systemd/dbus just to get cgroup metrics seems like the wrong idea. And a lot of people seem to have problems with cAdvisor.

flokli · 2017-10-23T14:46:06Z

So adding just the restart and state changes metrics here seems to be the right way to go for this PR.

@SuperQ Would a collectors/cgroups_linux.go be something possible to be integrated into the node_exporter project, or does this also have to be a separate cgroup_exporter due to the above policy?

SuperQ · 2017-10-23T14:49:36Z

I think a separate exporter for cgroups would probably be better at this time. I was thinking we may want a a cgroup library, similar to how we have procfs. We have a lot of scope creep issues with the node_exporter as is.

discordianfish · 2018-01-04T10:25:31Z

@arianvp Want to remove the cgroup stats from this PR and provide only the systemd-specific metrics like start/stop/restart counts, as @SuperQ suggested?

arianvp · 2018-01-11T16:48:37Z

Yes I will see if I can make some time this weekend to clean up the PR :)

SuperQ · 2018-01-31T16:23:41Z

Ping, if we can get this cleaned up soon, we can include it in the next release.

arianvp · 2018-02-01T23:02:47Z

I'll be at Fosdem this weekend. So I have 2 days of time to spend on FOSS :) So I'll move this PR to be ready for release then!

SuperQ · 2018-02-01T23:06:50Z

Yes, a bunch of us are also at FOSDEM. Look forward to nice graphs of the conference network.

arianvp · 2018-02-02T00:20:54Z

Do you guys have a preferred communication channel for prometheus? Perhaps we can meet up

SuperQ · 2018-02-02T08:35:08Z

We have an IRC/Matrix channel: https://prometheus.io/community/

arianvp · 2018-02-03T15:38:35Z

Last time I touched this code, dbus interface was giving uint64 for all these metrics, and now it's suddenly uint32, breaking the code. This took quite some time to debug today :) I'm not sure why dbus suddenly is spewing out other types... I don't know ver much about it. I can see if I can get it working on 236 again, but I can't guarentee it works for older versions. If the DBus interface is unstable like this, and this is different between systemd versions, I don't feel comfortable maintaining these patches.

flokli · 2018-02-04T16:12:40Z

At least according to src/core/dbus-unit.c, (didn't try it on my own bus) numeric values should have a signature of t, which translates to SD_BUS_TYPE_UINT64, and it looks pretty much like this hasn't changed recently.

arianvp · 2018-02-04T16:29:04Z

Weird...I'll investigate more..

…

On Sun, 4 Feb 2018, 17:12 Florian Klink, ***@***.***> wrote: At least according to src/core/dbus-unit.c <https://github.com/systemd/systemd/blob/master/src/core/dbus-unit.c#L1135>, (didn't try it on my own bus) numeric values should have a signature of t, which translates to SD_BUS_TYPE_UINT64 <https://github.com/systemd/systemd/blob/master/src/systemd/sd-bus-protocol.h#L50>, and it looks pretty much like this hasn't changed recently. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#709 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAmWo2g1npxTi9ELyjAo0iYnPRwjOQNNks5tRdb7gaJpZM4QBjqv> .

arianvp · 2018-02-05T11:20:32Z

It must have been an overdose of Club Mate. I can not reproduce anymore. haha. I'll clean up the PR Today

First of all, state changes of individual systemd units are now exposed. Things like: When was the unit started, when was it stopped, how many times did it restart before crash? How long until a timer activates. Second of all, behind an optional flag --collector.systemd.accounting additional metrics are defined, which expose systemd's view of CGroup statistics of a unit. These metrics are also exposed in cAdvisor, and to avoid duplicate metrics, this metric is explicitly opt in.

arianvp · 2018-02-05T21:19:37Z

I have updated the PR! I have for now hidden the metrics that conflict with cAdvisor behind a flag collector.systemd.accounting. We can also decide to fully remove them from this PR. I'm fine with that too.

However, I'm running into a merge conflict. Apparently a feature landed in master which seems to implement partially what I implemented here, but in a slightly different way.

@tomwilkie seems to have implemented exporting some timer metrics, but this PR will expose those as well (under systemd's naming convetions)
a7fd6b8

We should decide whether we would want to revert @tomwilkie 's commits in favor of mine.

Also, @tomwilkie seems to have done some refactoring to only keep open one dbus connection. This is also useful for me, and shouldn't be too hard to adapt. But first we'll have to decide if we should rever that commit before I can apply the patchset and start refactoring to adapt the single connection change

arianvp · 2018-02-05T21:22:04Z

What's left is:

Decide whether to keep resource accounting behind a flag, or throw away that part of the code
Add textual descriptions for each metric
normalise units so that we don't use "USec" and "MSec" etc?
Adapt to prometheus naming conventions
Decide what are Gauges and what are Counters. I now simply made everything a Counter but that's probably wrong
Decide whether we want the Wallclock or the Monotonic clock versions of the metrics like timer_last_triggered . I now simply export both though @tomwilkie changes seem to only be exporting the wallclock variant of the last_triggered metric

Hope to get some feedback

tomwilkie · 2018-02-05T21:26:16Z

Sorry @arianvp! I didn't see this PR. Let me know if you need a hand.

I'll be at Fosdem this weekend

So were a bunch of us - sorry I didn't get to meet you.

under systemd's naming convetions

This means this PR export usecs instead of seconds - is this desirable? What does the rest of node_exporter do?

arianvp · 2018-02-19T16:12:22Z

@tomwilkie could you have a look at the above questions I posted? I think we had a race condition because we seem to both have commented at the same time. You might have missed it.

arianvp · 2018-04-04T11:54:13Z

bump @tomwilkie

discordianfish · 2018-04-05T10:37:10Z

Regarding the time units: It should be use seconds, using only base metrics is generally best practice in Prometheus.

discordianfish · 2018-05-02T11:30:02Z

Going to close this for now. Feel welcome to re-open this once you updated it.

arianvp · 2024-09-03T15:02:18Z

I think a separate exporter for cgroups would probably be better at this time. I was thinking we may want a a cgroup library, similar to how we have procfs. We have a lot of scope creep issues with the node_exporter as is.

I bit the bullet and started work on this: https://github.com/arianvp/cgroup-exporter

arianvp force-pushed the master branch from 3e46c9d to 461d2e1 Compare October 21, 2017 13:04

arianvp force-pushed the master branch from 461d2e1 to d57e807 Compare October 21, 2017 13:40

arianvp force-pushed the master branch from d57e807 to 8f358f3 Compare October 21, 2017 14:03

flokli reviewed Oct 21, 2017

View reviewed changes

SuperQ reviewed Oct 21, 2017

View reviewed changes

flokli reviewed Oct 21, 2017

View reviewed changes

discordianfish added require/feedback pending/close labels Feb 1, 2018

arianvp force-pushed the master branch from 0c79b04 to 6b57724 Compare February 5, 2018 21:11

discordianfish closed this May 2, 2018

discordianfish mentioned this pull request May 30, 2018

Add systemd uptime metric collection #952

Merged

Export systemd's own metrics #709

Export systemd's own metrics #709

Conversation

arianvp commented Oct 21, 2017

arianvp commented Oct 21, 2017

arianvp commented Oct 21, 2017

flokli Oct 21, 2017

Choose a reason for hiding this comment

SuperQ Oct 21, 2017

Choose a reason for hiding this comment

flokli Oct 21, 2017 • edited Loading

Choose a reason for hiding this comment

SuperQ Oct 21, 2017

Choose a reason for hiding this comment

SuperQ commented Oct 21, 2017

arianvp commented Oct 21, 2017 • edited Loading

flokli Oct 21, 2017

Choose a reason for hiding this comment

arianvp Oct 22, 2017

Choose a reason for hiding this comment

SuperQ commented Oct 22, 2017

flokli commented Oct 22, 2017

SuperQ commented Oct 22, 2017

arianvp commented Oct 22, 2017

arianvp commented Oct 22, 2017 • edited Loading

flokli commented Oct 23, 2017

SuperQ commented Oct 23, 2017

flokli commented Oct 23, 2017

SuperQ commented Oct 23, 2017

discordianfish commented Jan 4, 2018

arianvp commented Jan 11, 2018

SuperQ commented Jan 31, 2018

arianvp commented Feb 1, 2018

SuperQ commented Feb 1, 2018

arianvp commented Feb 2, 2018

SuperQ commented Feb 2, 2018

arianvp commented Feb 3, 2018

flokli commented Feb 4, 2018

arianvp commented Feb 4, 2018 via email

arianvp commented Feb 5, 2018

arianvp commented Feb 5, 2018

arianvp commented Feb 5, 2018 • edited Loading

tomwilkie commented Feb 5, 2018

arianvp commented Feb 19, 2018

arianvp commented Apr 4, 2018

discordianfish commented Apr 5, 2018

discordianfish commented May 2, 2018

arianvp commented Sep 3, 2024

flokli Oct 21, 2017 •

edited

Loading

arianvp commented Oct 21, 2017 •

edited

Loading

arianvp commented Oct 22, 2017 •

edited

Loading

arianvp commented Feb 5, 2018 •

edited

Loading