Cross-platform .NET grok implementation as a NuGet package
Install as a library from Nuget:
PM> Install-Package Grok.Net
Install as a PowerShell module from PowershellGallery:
Install-Module -Name Grok
Since v.2.0.0, the grok uses the PCRE.NET library for regex.
Grok is a great way to parse unstructured log data into something structured and queryable. It sits on top of Regular Expression (regex) and uses text patterns to match lines in log files.
A great way to get started with building your grok filters is this grok debug tool: https://grokdebugger.com
What can I use Grok for?
- reporting errors and other patterns from logs and processes
- parsing complex text output and converting it to JSON for external processing
- apply 'write-once use-everywhere' to regular expressions
- automatically providing patterns for unknown text inputs (logs you want patterns generated for future matching)
The syntax for a grok pattern is %{SYNTAX:SEMANTIC}
The SYNTAX
is the name of the pattern that will match your text. SEMANTIC
is the key.
For example, 3.44
will be matched by the NUMBER
pattern, and 55.3.244.1
will be matched by the IP
pattern. 3.44
could be the duration of an event, so you could call it simply duration
. Further, a string 55.3.244.1
might identify the client
making a request.
For the above example, your grok filter would look something like this:
%{NUMBER:duration} %{IP:client}
Examples: With that idea of syntax and semantics, we can pull out useful fields from a sample log like this fictional HTTP request log:
55.3.244.1 GET /index.html 15824 0.043
The pattern for this could be:
%{IP:client} %{WORD:method} %{URIPATHPARAM:request} %{NUMBER:bytes} %{NUMBER:duration}
More about grok
Create a new instance with grok pattern:
Grok grok = new Grok("%{MONTHDAY:month}-%{MONTHDAY:day}-%{MONTHDAY:year} %{TIME:timestamp};%{WORD:id};%{LOGLEVEL:loglevel};%{WORD:func};%{GREEDYDATA:msg}");
then prepare some logs to parse
string logs = @"06-21-19 21:00:13:589241;15;INFO;main;DECODED: 775233900043 DECODED BY: 18500738 DISTANCE: 1.5165
06-22-19 22:00:13:589265;156;WARN;main;DECODED: 775233900043 EMPTY DISTANCE: --------";
You are ready to parse and print the result
var grokResult = grok.Parse(logs);
foreach (var item in grokResult)
{
Console.WriteLine($"{item.Key} : {item.Value}");
}
output:
month : 06
day : 21
year : 19
timestamp : 21:00:13:589241
id : 15
loglevel : INFO
func : main
msg : DECODED: 775233900043 DECODED BY: 18500738 DISTANCE: 1.5165
month : 06
day : 22
year : 19
timestamp : 22:00:13:589265
id : 156
loglevel : WARN
func : main
msg : DECODED: 775233900043 EMPTY DISTANCE: --------
or use ToDictionary()
on grokResult
to get the result as IReadOnlyDictionary<string, IEnumerable<object>>
There is the possibility to add your own patterns.
Create a file and write the pattern you need as the pattern name, space, and then the regexp for that pattern.
For example, Patterns\grok-custom-patterns:
ZIPCODE [1-9]{1}[0-9]{2}\s{0,1}[0-9]{3}
then load the file and pass the stream to Grok:
FileStream customPatterns = System.IO.File.OpenRead(@"Patterns\grok-custom-patterns");
Grok grok = new Grok("%{ZIPCODE:zipcode}:%{EMAILADDRESS:email}", customPatterns);
var grokResult = grok.Parse($"122001:Bob.Davis@microsoft.com");
Define a collection of patterns
var custom = new Dictionary<string, string>
{
{"BASE64", "(?=(.{4})*$)[A-Za-z0-9+/]*={0,2}$"}
};
and use it as follows
var grok = new Grok("Basic %{BASE64:credentials}", custom);
GrokResult grokResult = grok.Parse("Basic YWRtaW46cGEkJHdvcmQ=");
Install and use the Grok as a PowerShell module
grok -i "06-21-19 21:00:13:589241;15;INFO;main;DECODED: 775233900043 DECODED BY: 18500738 DISTANCE: 1.5165" -g "%{MONTHDAY:month}-%{MONTHDAY:day}-%{MONTHDAY:year} %{TIME:timestamp};%{WORD:id};%{LOGLEVEL:loglevel};%{WORD:func};%{GREEDYDATA:msg}"
To get help use help grok
command
On Windows:
build.ps1
On Linux/Mac:
build.sh
Would you like to help make grok.net even better? We keep a list of issues that are approachable for newcomers under the good-first-issue label.
Also. please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.
This project is licensed under the MIT License - see the LICENSE.md file for details
Thanks to @martinjt. The project is based on martinjt/grokdotnet.