query: fix querying with interleaved data #5035

GiedriusS · 2022-01-06T16:30:23Z

Fix querying when some data has been pushed down and some - hasn't.
Since max_over_time and min_over_time remove __name__ from the
results either way, let's do that inside Select() to have a unified form
of data.

Add test to cover this case.

Without this code, the test fails:

level=error ts=2022-01-06T16:28:37.959956685Z msg="function failed. Retrying in next tick" err="read query instant response: expected 2xx response, got 422. Body: {\"status\":\"error\",\"errorType\":\"execution\",\"error\":\"vector cannot contain metrics with the same labelset\"}\n"

Closes #5033.

GiedriusS · 2022-01-06T16:31:49Z

cc @fpetkovski

Fix querying when some data has been pushed down and some - hasn't. Since `max_over_time` and `min_over_time` remove `__name__` from the results either way, let's do that inside Select() to have a unified form of data. Add test to cover this case. Without this code, the test fails: ``` level=error ts=2022-01-06T16:28:37.959956685Z msg="function failed. Retrying in next tick" err="read query instant response: expected 2xx response, got 422. Body: {\"status\":\"error\",\"errorType\":\"execution\",\"error\":\"vector cannot contain metrics with the same labelset\"}\n" ``` Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

yeya24 · 2022-01-07T07:21:45Z

test/e2e/query_test.go

+	testutil.Ok(t, err)
+	t.Cleanup(e2ethanos.CleanScenario(t, e))
+
+	prom1, sidecar1, err := e2ethanos.NewPrometheusWithSidecar(e, "p1", defaultPromConfig("p1", 0, "", ""), "", e2ethanos.DefaultPrometheusImage(), "remote-write-receiver")


Is it intended to enable remote write receiver?

The test synthesizes samples by remote writing them, which is why the receiver feature is needed.

Thanks @fpetkovski. I thought it was using data from the block only. Didn't notice the remote write part.

test/e2e/query_test.go

Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

yeya24

LGTM

yeya24 · 2022-01-07T17:37:02Z

pkg/query/querier.go

+	// Delete the metric's name from the result because that's what the
+	// PromQL does either way and we want our iterator to work with data
+	// that was either pushed down or not.
+	if q.enableQueryPushdown && (hints.Func == "max_over_time" || hints.Func == "min_over_time") {


Just curious why promql does this. Isn't it a bug?

Nope, it is a feature. Think of these functions like they are calculating the maximum over all input, not just for each unique labelset. I think this is where it happens: https://github.com/prometheus/prometheus/blob/d677aa4b29a8a0cf9f61af04bbf5bfdce893cf23/promql/engine.go#L1304-L1310.

Once more functions are being pushed down, we will need to improve the logic here. Probably we will need to refactor this into a separate struct? I don't know yet

pull-request-size bot added the size/L label Jan 6, 2022

GiedriusS force-pushed the attach_metrics_name branch from 62bef32 to fac4e88 Compare January 6, 2022 16:41

fpetkovski previously approved these changes Jan 7, 2022

View reviewed changes

yeya24 reviewed Jan 7, 2022

View reviewed changes

e2e: add more tricky interleaved data test cases

203f449

Signed-off-by: Giedrius Statkevičius <giedrius.statkevicius@vinted.com>

GiedriusS dismissed fpetkovski’s stale review via 203f449 January 7, 2022 10:10

GiedriusS requested a review from yeya24 January 7, 2022 12:36

yeya24 approved these changes Jan 7, 2022

View reviewed changes

yeya24 reviewed Jan 7, 2022

View reviewed changes

GiedriusS merged commit 9a8c984 into thanos-io:main Jan 10, 2022

GiedriusS deleted the attach_metrics_name branch January 10, 2022 07:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

query: fix querying with interleaved data #5035

query: fix querying with interleaved data #5035

GiedriusS commented Jan 6, 2022 •

edited

Loading

GiedriusS commented Jan 6, 2022

yeya24 Jan 7, 2022

fpetkovski Jan 7, 2022

yeya24 Jan 7, 2022

yeya24 left a comment

yeya24 Jan 7, 2022

GiedriusS Jan 10, 2022

GiedriusS Jan 10, 2022

query: fix querying with interleaved data #5035

query: fix querying with interleaved data #5035

Conversation

GiedriusS commented Jan 6, 2022 • edited Loading

GiedriusS commented Jan 6, 2022

yeya24 Jan 7, 2022

Choose a reason for hiding this comment

fpetkovski Jan 7, 2022

Choose a reason for hiding this comment

yeya24 Jan 7, 2022

Choose a reason for hiding this comment

yeya24 left a comment

Choose a reason for hiding this comment

yeya24 Jan 7, 2022

Choose a reason for hiding this comment

GiedriusS Jan 10, 2022

Choose a reason for hiding this comment

GiedriusS Jan 10, 2022

Choose a reason for hiding this comment

GiedriusS commented Jan 6, 2022 •

edited

Loading