add journalparser plugin #2569

phemmer · 2017-03-24T14:29:56Z

Adds a journalparser plugin. The plugin tries to be as similar to configuration of the logparser plugin as possible, and shares the grok parser code.

This plugin does have the unfortunate result of breaking cross-compilation. This is because it uses CGO for the systemd journal libs. You can get cross compilation working again, but you need a cross-compilation toolchain on the build host.

Required for all PRs:

CHANGELOG.md updated (we recommend not updating this until the PR has been approved by a maintainer)
Sign CLA (if not already signed)
README.md updated (if adding a new plugin)

Closes #2109

phemmer · 2017-03-24T14:33:28Z

plugins/inputs/logparser/grok/grok.go

@@ -281,7 +278,7 @@ func (p *Parser) ParseLine(line string) (telegraf.Metric, error) {
 		}
 	}

-	return metric.New(p.Measurement, tags, fields, p.tsModder.tsMod(timestamp))


This line is the reason for all the grok and logparser changes.
metric.New() will return error if there are no fields present. In the journalparser plugin this may happen as the user may have configured the plugin to parse multiple journal fields. The journalparser plugin calls ParseLine for each journal field and then merges all the results together. And so one of the calls might not extract any fields.

phemmer · 2017-03-24T14:39:41Z

Looks like build is failing because it needs the systemd development headers. I'm not familiar with CircleCI and how to make sure these are installed.

danielnelson · 2017-03-24T20:45:10Z

AFAIK In the past we haven't accepted plugins that require C libraries, perhaps we can add this to the extras plugin repo once it is ready.

phemmer · 2017-03-24T21:13:57Z

If the requirement is just not having external libraries so that users don't have to install them to compile telegraf, what about including the header file in the telegraf repo? Since the libs are dynamically loaded, they don't need to be present when building, or even running (as long as you don't try to use a plugin which requires them).

danielnelson · 2017-03-24T23:21:48Z

I don't want to copy in code from other projects, and it really ought to use the machines systemd headers. I wonder if there is a way we can make this plugin optional without making our build system totally non standard. In my mind the ideal setup would be that if you have the header then it builds and otherwise it doesn't.

phemmer · 2017-03-25T01:27:40Z

Well to play devils advocate, copying code is kinda what the go vendoring thing is about.
Systemd also offers an API stability promise that they won't break things: https://www.freedesktop.org/wiki/Software/systemd/InterfacePortabilityAndStabilityChart/ (this pr uses "sd-journal.h API"). So using an out-of-date header file should cause no harm.

An alternative would be to do this: https://godoc.org/github.com/rainycape/dl#example-Open--Snprintf. We wouldn't need the header files at all this way.

As for a way to make the plugin optional. You can accomplish anything with go generate. You could set it up so that go generate will enable the plugin for building if the headers were found. You could also do build flags, such that it's on or off by default, and then a build flag to flip it the other way.

nhaugo · 2017-03-31T22:00:53Z

1.3 will utilize modules from go 1.8, we will have to wait until then to get this in do to the dependency on external c libs.

phemmer · 2017-04-30T19:51:10Z

@nhaugo I thought that was on hold. #2373 (comment)

Anyway, I just pushed an alternate implementation for consideration. It uses the journalctl command and parses the output. I didn't do this initially as I'm not fond of spawning external processes. However the implementation I put in keeps this to a minimum (1 process in all but the rarest of use cases). It also lets the external journalctl process do the heavy filtering, instead of doing it in telegraf.

So in a more descriptive nutshell, what it does:

Forks of a single journalctl with a disjunction (OR list) of all the match sets. Meaning if one instance of [[inputs.journalparser]] has matches = ["_COMM=httpd","PRIORITY=6"] and another instance has ["_COMM=nginx","PRIORITY=6"], then journalctl will look for (_COMM=httpd PRIORITY=6) OR (_COMM=nginx PRIORITY=6).
Every time the match list is updated, a new journalctl with updated args is spawned, and we kill off the old one.
Since we only have a single stream for all journal events, we then have to filter again to send them to the right journalparser instance, but this shouldn't be that bad since journalctl already has done the heavy filtering for us.
Also because of this, each journal entry is only parsed once. In the previous implementation, if the same journal entry was matched by multiple journalparser instances, we'd parse it multiple times. The previous implementation was also opening up the journal for every journalparser instance, so even though filtering was done in the C code, it was still filtering the same data multiple times.
The rare use case where multiple journalctl processes are spawned is if the path property is changed. We launch one journalctl process per path value.

This has barely been tested (though it does have a test suite which is passing). And I think I'd like to clean up the unit tests a bit. But it should be functional.

danielnelson · 2017-05-18T02:01:26Z

I would be okay with the version requiring the headers so long as they are not needed by default, and we add something to the build to make sure we build support for it. Maybe we just add a --list-inputs style option so we can check that everything was enabled.

Automatically enabling features would be cool but I don't want to roll my own solution for this. Plugin thing is paused, probably will try to do a gRPC solution.

The subprocess version sounds fine as well. Which implementation do you prefer?

phemmer · 2017-05-20T04:21:33Z

I'm not sure which I prefer.
External process is nice in that you don't have compile time build features you have to enable (though I would argue that this is really an artificial limitation of the coreos/go-systemd implementation).
But on the other hand not relying on external utilities is nice. And the code for using journalctl is significantly more complex.

DSpeichert · 2019-08-25T23:22:28Z

@phemmer Any chance of resurrecting this? I'd love to have this input merge and I can help if needed.

danielnelson · 2019-08-28T03:13:32Z

I think we should probably close this issue out for now. We haven't gotten any closer to solving the issue of optional C library dependencies since this issue was last updated and no real tooling has emerged that could help. Would still be willing to accept a native Go solution however I'm not sure journald provides an API outside of the C library.

It is also now possible to forward syslog messages into Telegraf in a performant and reliable way, and I think this is a very good method for handling log messages both when using journald.

phemmer · 2019-08-28T03:18:42Z

Just FYI, the code that was up was a go native solution. However I no longer use telegraf, so closing is the right action. But it could be revived by someone else.

…

On August 27, 2019 11:13:53 PM EDT, Daniel Nelson ***@***.***> wrote: I think we should probably close this issue out for now. We haven't gotten any closer to solving the issue of optional C library dependencies since this issue was last updated and no real tooling has emerged that could help. Would still be willing to accept a native Go solution however I'm not sure journald provides an API outside of the C library. It is also now possible to forward syslog messages into Telegraf in a performant and reliable way, and I think this is a very good method for handling log messages both when using journald. -- You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub: #2569 (comment)

danielnelson · 2019-08-28T03:36:15Z

Thanks for the clarification, that's right we had a CGO implementation but this is an fork/exec version of it.

@DSpeichert I would still recommend trying the syslog plugin to see if it will work for you first, but if you are still interested in finishing up this plugin let me know, there has been some changes in Telegraf that would affect how we should go about it.

gdamjan · 2019-08-28T12:32:04Z

syslog transport would loose the structured data that's in the journal

DSpeichert · 2019-08-28T13:26:58Z

@danielnelson I explored the option of forwarding journald log to telegraf via syslog protocol.
I'm sorry for hijacking this issue into a question about that feature, but based on journald documentation:

With the first method, messages are immediately forwarded to a socket (/run/systemd/journal/syslog), where the traditional syslog daemon can read them.

I don't think that the telegraf plugin supports that, it seems to list support for TCP, UDP and TLS only.

It isn't any harder in Go to listen on a Unix socket, but I haven't tried this and I'm not sure if journald is creating that socket or expecting e.g. telegraf to create that. It also requires telegraf to start early in the boot process and immediately read everything or risk losing it, vs a tailing approach that can catch up for a longer time.

glinton · 2019-08-28T14:24:27Z

it seems to list support for TCP, UDP and TLS only

That must have been a documentation oversight, as unix sockets are supported by the syslog input.

danielnelson · 2019-08-28T19:10:56Z

I'll update the documentation for that. Right now the chain needs to be journald -> rsyslog -> syslog input. In order to take rsyslog out of the pipeline we would need support for RFC 3164 messages (#4593).

syslog transport would loose the structured data that's in the journal

This is true, you could try using imjournal in rsyslog to bring this to over. If we wanted to take rsyslog out and have structured data, then I believe we are essentially back to this PR.

dokterbob · 2021-11-04T12:27:30Z

Hey, why is this closed?

dokterbob · 2021-11-04T12:28:56Z

In other words; what is required to get this code in?

phemmer commented Mar 24, 2017

View reviewed changes

phemmer mentioned this pull request Mar 24, 2017

[Feature request] add input plugin to tail systemd journal #2109

Closed

nhaugo added this to the Future Milestone milestone Mar 31, 2017

add journalparser plugin

fcbe8a1

danielnelson removed this from the Future Milestone milestone Jun 14, 2017

danielnelson added the feat Improvement on an existing feature such as adding a new setting/mode to an existing plugin label Aug 24, 2017

russorat added the new plugin label Jan 20, 2018

danielnelson closed this Aug 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add journalparser plugin #2569

add journalparser plugin #2569

phemmer commented Mar 24, 2017 •

edited

Loading

phemmer Mar 24, 2017

phemmer commented Mar 24, 2017

danielnelson commented Mar 24, 2017

phemmer commented Mar 24, 2017

danielnelson commented Mar 24, 2017

phemmer commented Mar 25, 2017

nhaugo commented Mar 31, 2017

phemmer commented Apr 30, 2017

danielnelson commented May 18, 2017

phemmer commented May 20, 2017

DSpeichert commented Aug 25, 2019

danielnelson commented Aug 28, 2019

phemmer commented Aug 28, 2019 via email

danielnelson commented Aug 28, 2019

gdamjan commented Aug 28, 2019

DSpeichert commented Aug 28, 2019

glinton commented Aug 28, 2019

danielnelson commented Aug 28, 2019

dokterbob commented Nov 4, 2021

dokterbob commented Nov 4, 2021

add journalparser plugin #2569

add journalparser plugin #2569

Conversation

phemmer commented Mar 24, 2017 • edited Loading

Required for all PRs:

phemmer Mar 24, 2017

Choose a reason for hiding this comment

phemmer commented Mar 24, 2017

danielnelson commented Mar 24, 2017

phemmer commented Mar 24, 2017

danielnelson commented Mar 24, 2017

phemmer commented Mar 25, 2017

nhaugo commented Mar 31, 2017

phemmer commented Apr 30, 2017

danielnelson commented May 18, 2017

phemmer commented May 20, 2017

DSpeichert commented Aug 25, 2019

danielnelson commented Aug 28, 2019

phemmer commented Aug 28, 2019 via email

danielnelson commented Aug 28, 2019

gdamjan commented Aug 28, 2019

DSpeichert commented Aug 28, 2019

glinton commented Aug 28, 2019

danielnelson commented Aug 28, 2019

dokterbob commented Nov 4, 2021

dokterbob commented Nov 4, 2021

phemmer commented Mar 24, 2017 •

edited

Loading