Monitor batch write #22

lifenod · 2018-12-27T17:01:37Z

solve #13

junhuif

Generally fine, some problems that need to handle:

As we discussed that we don't merge the Fix parse monitor URL #21 , so this PR shouldn't base on the changes of Fix parse monitor URL #21.
We need to find a way to push the data in the queue before the go routine being cancelling, currently, the data in the queue will lost once the go routine is cancelled.

(But I don't know how to do this yet, maybe golang has a way to know about the go routine is being cancelled).

lifenod · 2018-12-28T09:52:57Z

We need to find a way to push the data in the queue before the go routine being cancelling, currently, the data in the queue will lost once the go routine is cancelled.

May we do some cleaning work when we receive the SIGTERM signal, need to do some research.

A simple way, wait > 1m (or set a smaller batch interval) when to close the pod.

lifenod · 2018-12-28T10:50:29Z

A simple way, wait > 1m (or set a smaller batch interval) when to close the pod.

@qi-zhou Please confirm if K8s has this waiting time.

bodhi

I suspect using a shared array and a mutex could cause "starvation" when you have high traffic.

Only the goroutines from the HTTP requests may acquire the lock, and the goroutine to write the batched requests may never get the lock.

Remember the Go "proverb":

Do not communicate by sharing memory; instead, share memory by communicating.

(https://blog.golang.org/share-memory-by-communicating)

Create an input channel for events, instead of the shared array + lock.
Each request writes an event to the input channel.
The batching go-routine reads events from the channel and adds them to an array.
Either after the timeout or after you reach some large batch size, you can send the the batch of points to InfluxDB.

Benefits:

You don't need a shared array, because the buffer is private to the batching goroutine.
Because you don't have a shared array, you don't need the mutex.
It will be much easier to retry when sending data to InfluxDB fails for a 5xx reason.
You can control the timeout and the size of the buffer that you send to InfluxDB. Here's a refernece

bodhi · 2018-12-28T12:55:04Z

Oops. Review submitted early by accident, sorry!

Here's a refernece

What I was going to say was: Here's an (older) reference that says 5-10k events per request is a good amount: https://community.influxdata.com/t/what-is-the-highest-performance-method-of-getting-data-in-out-of-influxdb/464

lifenod · 2019-01-07T06:32:21Z

move to #25

lifenod changed the base branch from master to fix-parse-monitor-url December 27, 2018 17:01

lifenod requested a review from junhuif December 27, 2018 17:02

lifenod force-pushed the monitor-batch-write branch from e85fca2 to 895f8da Compare December 27, 2018 17:03

lifenod self-assigned this Dec 28, 2018

lifenod removed the request for review from junhuif December 28, 2018 03:01

monitoring: add cache points and batch write

3bebb6a

lifenod force-pushed the monitor-batch-write branch from 895f8da to 3bebb6a Compare December 28, 2018 04:05

monitoring: add batch-write-second-interval parameter

cdab3a4

lifenod requested a review from junhuif December 28, 2018 04:49

junhuif reviewed Dec 28, 2018

View reviewed changes

bodhi reviewed Dec 28, 2018

View reviewed changes

lifenod closed this Jan 7, 2019

lifenod deleted the monitor-batch-write branch January 7, 2019 06:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitor batch write #22

Monitor batch write #22

lifenod commented Dec 27, 2018

junhuif left a comment •

edited

Loading

lifenod commented Dec 28, 2018 •

edited

Loading

lifenod commented Dec 28, 2018

bodhi left a comment

bodhi commented Dec 28, 2018

lifenod commented Jan 7, 2019

Monitor batch write #22

Monitor batch write #22

Conversation

lifenod commented Dec 27, 2018

junhuif left a comment • edited Loading

Choose a reason for hiding this comment

lifenod commented Dec 28, 2018 • edited Loading

lifenod commented Dec 28, 2018

bodhi left a comment

Choose a reason for hiding this comment

bodhi commented Dec 28, 2018

lifenod commented Jan 7, 2019

junhuif left a comment •

edited

Loading

lifenod commented Dec 28, 2018 •

edited

Loading