Support tagfiltertree for fast matching metricIDs to queries #4310

shaan420 · 2024-10-29T18:54:13Z

This PR introduces a new data structure which we call "tagfiltertree".
A query consists of several tag filters. A tag filter is nothing but a tag and value pair where the value is a pattern much like a regex. For instance, "service:foo" is a valid tag filter. A query is nothing but a list of such tag filters that need to be matched with Conjunction within an incoming metricID.

The key observation behind creating a tree structure is that the presence of a tag in a metricID and a query has the potential to drastically prune the search space of the possible queries going forward. So much so that within a couple comparisons we expect an incoming metricID to output the entire list of matched queries.

This package optimizes CPU and memory for match time as opposed to creation of the tree itself. It is not thread-safe so make sure to protect the tree appropriately if it is going to be used in a multithreaded context.

Refer to the README.md for more details.

What this PR does / why we need it:
Use this data structure when you want to quickly match an input metricID to a list of queries.

Does this PR introduce a user-facing and/or backwards incompatible change?:

NONE

Does this PR require updating code package or user-facing documentation?:

NONE

fengcheng1518 · 2024-10-30T03:47:42Z

src/metrics/tagfiltertree/tag_filter_tree.go

+}
+
+// IsVarTagValue returns true if the value is a variable tag value.
+func IsVarTagValue(value string) bool {


add some explanation? e.g. I am not sure why len<4 means false

a Var value type is of the form "{{...}}" so the minimum is 4 chars. Anything lesser and we know it's not a Var value type

nit: add comments to explain what Var Tag means. No reviewer can correlate "VarTagValue" with "...{{...}}..." pattern

fengcheng1518 · 2024-10-30T03:50:41Z

src/metrics/tagfiltertree/tag_filter_tree.go

+type Tag struct {
+	Name string
+	Val  string
+	Var  string


where is the usage of Var? I don't find any

Var will be used in Namespace Attribution to support capture-groups like functionality in Regex.

so it will not be used in query matching? nit: add some comments to make the code more readable

fengcheng1518 · 2024-10-30T04:26:41Z

src/metrics/tagfiltertree/tag_filter_tree.go

+	return false
+}
+
+// IsMatchNoneTag returns true if the tag is a match none tag.


IsNegation?

Good point. Done.

fengcheng1518 · 2024-10-30T04:46:41Z

src/metrics/tagfiltertree/tag_filter_tree.go

+		}
+		tagValue, tagNameFound := tags[name]
+		absVal, absValFound := node.AbsoluteValues[tagValue]
+		if tagNameFound && absValFound {


if isMatchAny==false, then within this if condition, you will never return true? is that correct logic?

Ha! good catch! yeah this is a bug. We cannot rely on the returned "matched" bool flag. I'll fix it.

…cting generic types

justinjc · 2024-10-30T20:19:47Z

.golangci.yml

@@ -186,7 +186,6 @@ linters:
    - gci
    - goconst
    - gocritic
-    - golint


golint is deprecated now and mainly not detecting the type correctly when using Generics.

justinjc · 2024-10-30T20:23:17Z

src/metrics/tagfiltertree/README.md

+a set of tag filters. One such use-case is metric attribution to namespaces.
+Iterating through each filter individually and matching them is extremely expensive
+since it has to be done on each incoming metricID. Therefore, this data structure
+pre-compiles a set of tag filters in order to optimize matches against an input metricID.


I don't really understand this paragraph.

"attribution to namespaces" - what does this mean? What is a namespace in this context?

How does this pre-compiled data structure prevent you from having to do matching on each incoming metricID?

Perhaps a diagram or example would help here.

Updated the Readme

justinjc · 2024-10-30T20:30:05Z

src/metrics/tagfiltertree/README.md

+pre-compiles a set of tag filters in order to optimize matches against an input metricID.
+
+## Usage
+First create a trie using New() and then add tagFilters using AddTagFilter().


And then I guess you use Match somehow? A code example here would be useful.

shaan420 added 4 commits October 29, 2024 11:34

Support tagfiltertree for fast matching metricIDs to queries

13cf2c2

lint fixes

4c96204

lint fixes

bde1d75

lint fixes

ee6c313

fengcheng1518 reviewed Oct 30, 2024

View reviewed changes

shaan420 added 3 commits October 29, 2024 23:40

review comments

b9368bb

fix bug in Match() logic

3423225

remove golint since it is deprecated. It was causing issues with dete…

c65aa85

…cting generic types

justinjc reviewed Oct 30, 2024

View reviewed changes

update README.md

41769a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support tagfiltertree for fast matching metricIDs to queries #4310

Support tagfiltertree for fast matching metricIDs to queries #4310

shaan420 commented Oct 29, 2024

fengcheng1518 Oct 30, 2024

shaan420 Oct 30, 2024

fengcheng1518 Oct 30, 2024 •

edited

Loading

fengcheng1518 Oct 30, 2024

shaan420 Oct 30, 2024

fengcheng1518 Oct 30, 2024

fengcheng1518 Oct 30, 2024

shaan420 Oct 30, 2024

fengcheng1518 Oct 30, 2024

shaan420 Oct 30, 2024

justinjc Oct 30, 2024

shaan420 Oct 31, 2024

justinjc Oct 30, 2024

shaan420 Oct 31, 2024

justinjc Oct 30, 2024

shaan420 Oct 31, 2024

Support tagfiltertree for fast matching metricIDs to queries #4310

Are you sure you want to change the base?

Support tagfiltertree for fast matching metricIDs to queries #4310

Conversation

shaan420 commented Oct 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fengcheng1518 Oct 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fengcheng1518 Oct 30, 2024 •

edited

Loading