Implement queue to forward traces to metrics-generator #1331

mapno · 2022-03-08T14:46:30Z

What this PR does:

The distributor uses a forwarder to queue and forward requests to the metrics generators.

This queue will drop the oldest item in the queue if it's full and a new request is pushed.

Which issue(s) this PR fixes:
Contributes to #1303

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

joe-elliott

Nice work. I feel strongly that a go channel approach would be preferred. Let's discuss.

Edit:
Just thought of another question. How do we make sure these queues are drained before during shutdown?

modules/distributor/distributor.go

modules/distributor/forwarder.go

joe-elliott

A few thoughts. I'm concerned about two things

Each queue getting its own set of workers. It would be far simpler for all queues to share the same worker pool.
Managing the overrides in the queueManager. It feels like there's a simpler pattern where the forwarder sees that an update occurred, creates a new queueManager for the tenant and adds it to the map with the new configuration, and then just tells the existing queueManager to drain itself.

modules/distributor/distributor.go

modules/distributor/forwarder.go

yvrhdn

I'd be very interested to see how this performs in a bigger cluster! This will reduce the concurrency with which we process requests, so I'm curious to see what amount of workers works well.

modules/distributor/distributor.go

modules/overrides/overrides.go

modules/distributor/forwarder.go

modules/distributor/forwarder_test.go

yvrhdn

Left some comments about how we manage the queueManager's, I think we can improve locking a bit so writes don't block each other.

modules/distributor/forwarder.go

yvrhdn

Okay, I've left some more comments. Thanks for the replies, the structure is making more sense now 🙂

I think these are the essential metrics for monitoring monitoring distributor -> metrics-generator traffic:

tempo_distributor_forwarder_queue_length
tempo_distributor_forwarder_pushes
tempo_distributor_forwarder_dropped_pushes
tempo_distributor_metrics_generator_pushes_total
tempo_distributor_metrics_generator_pushes_failures_total

Should we rename tempo_distributor_forwarder_dropped_pushes to tempo_distributor_forwarder_pushes_failures_total?

modules/overrides/limits.go

modules/distributor/forwarder.go

Use eviciting queue to buffer and send push requests from the distributor to the generator. This queue will drop the oldest item in the queue if it's full and a new request is pushed.

yvrhdn

Alright, this is looking good. The experiments in our internal environments also showed good results :)
I'm ready to approve and merge this PR, just left a comment about the metrics we are recording.

modules/distributor/forwarder.go

modules/overrides/limits.go

mapno force-pushed the distributor-generator-queue branch 4 times, most recently from 00b61b7 to 7848ea9 Compare March 15, 2022 10:04

mapno marked this pull request as ready for review March 15, 2022 11:39

mapno requested review from joe-elliott, annanay25, mdisibio, dgzlopes, yvrhdn and zalegrala as code owners March 15, 2022 11:39

joe-elliott reviewed Mar 16, 2022

View reviewed changes

joe-elliott reviewed Mar 17, 2022

View reviewed changes

modules/distributor/distributor.go Outdated Show resolved Hide resolved

modules/distributor/forwarder.go Outdated Show resolved Hide resolved

yvrhdn reviewed Mar 23, 2022

View reviewed changes

mapno force-pushed the distributor-generator-queue branch 2 times, most recently from 4f85511 to e3afa0b Compare March 30, 2022 14:43

yvrhdn reviewed Apr 1, 2022

View reviewed changes

mapno force-pushed the distributor-generator-queue branch from c05c36a to df8c71d Compare April 5, 2022 16:17

yvrhdn reviewed Apr 12, 2022

View reviewed changes

mapno added 11 commits April 18, 2022 09:35

Implement queue to forward traces to metrics-generator

72c4e81

Use eviciting queue to buffer and send push requests from the distributor to the generator. This queue will drop the oldest item in the queue if it's full and a new request is pushed.

Format

e829962

Evicting queue improvements and fixes

d970c16

Change implementation away from subscriptions

0484737

Implement circular queue

19087a9

Lint

8e3464c

Add forwarder

1ec5692

Changelog entry

1fb0ea0

ShutdownCh unused

8c3ec52

Add TODOs

8207c61

Add overwrite function to queue

53f9202

mapno added 12 commits April 18, 2022 09:35

Add overwrites metric

e50cf9b

Switch to channels approach

4755487

Refactor to address comments

9541178

minor fix

42c2152

lint

8cd5f6c

Address last comments

66e1d40

Add metric for queue length

be784b3

Address comments

847e998

Use RWMutex

0ee0810

Record metrics synchronously

ae8e952

Add queueManager default config

9a817aa

Rename forwarder metric

ffd1033

mapno force-pushed the distributor-generator-queue branch from df8c71d to ffd1033 Compare April 18, 2022 08:40

mapno added 2 commits April 18, 2022 10:45

Remove unused method

f41ba09

Fix metric name

d1c7615

yvrhdn reviewed Apr 22, 2022

View reviewed changes

modules/distributor/forwarder.go Outdated Show resolved Hide resolved

modules/overrides/limits.go Outdated Show resolved Hide resolved

Address last comments

c50db83

yvrhdn approved these changes Apr 22, 2022

View reviewed changes

mapno merged commit f2406df into grafana:main Apr 22, 2022

mapno deleted the distributor-generator-queue branch April 22, 2022 14:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement queue to forward traces to metrics-generator #1331

Implement queue to forward traces to metrics-generator #1331

mapno commented Mar 8, 2022 •

edited

Loading

joe-elliott left a comment •

edited

Loading

joe-elliott left a comment

yvrhdn left a comment

yvrhdn left a comment

yvrhdn left a comment

yvrhdn left a comment

Implement queue to forward traces to metrics-generator #1331

Implement queue to forward traces to metrics-generator #1331

Conversation

mapno commented Mar 8, 2022 • edited Loading

joe-elliott left a comment • edited Loading

Choose a reason for hiding this comment

joe-elliott left a comment

Choose a reason for hiding this comment

yvrhdn left a comment

Choose a reason for hiding this comment

yvrhdn left a comment

Choose a reason for hiding this comment

yvrhdn left a comment

Choose a reason for hiding this comment

yvrhdn left a comment

Choose a reason for hiding this comment

mapno commented Mar 8, 2022 •

edited

Loading

joe-elliott left a comment •

edited

Loading