perf(storagenode): introduce accumulator #415

ijsong · 2023-04-11T13:48:08Z

Motivation

A client writes log entries by calling Append RPC. The request message of Append RPC can contain a batch of log entries to write a bunch of logs at once, reducing network RTTs. Batching log entries also makes the log-appending pipeline in a storage node efficient since the storage layer in the storage node stores a set of log entries as a batch rather than one by one.

                                            +-----------+
                                            |           |
                                       +--->| committer |
                                       |    |           |
                                       |    +-----------+
+---------------+                      |                 
| AppendRequest |      +-----------+   |    +-----------+
|   +-+-+-+-+   |      |           |   |    |           |
|   |1|2|3|4|---+----->| sequencer |---+--->|  writer   |
|   +-+-+-+-+   |      |           |   |    |           |
+---------------+      +-----------+   |    +-----------+
                                       |                 
                                       |    +-----------+
                                       |    |           |
                                       +--->|replicator |
                                            |           |
                                            +-----------+

It works well if a client writes many log entries at a single request. For example, the append request can write four logs simultaneously, as shown in the figure above.

However, when the append request has a few log entries, for instance, a single log entry in a request, the storage node cannot benefit from the batch write of the storage layer. In the current implementation, the storage node flows the log entries only in a single request into the log-appending pipeline.

We introduce the accumulator to collect log entries in front of the sequencer and to form a batch.

+---------------+                                                           
| AppendRequest |                                                           
|   +-+         |                                                           
|   |1|         |--+                                           +-----------+
|   +-+         |  |                                           |           |
+---------------+  |                                        +->| committer |
+---------------+  |                                        |  |           |
| AppendRequest |  |                                        |  +-----------+
|   +-+         |  |                +-+-+-+-+               |               
|   |2|         |--+  +-----------+ |1|2|3|4| +-----------+ |  +-----------+
|   +-+         |  |  |           | +-+-+-+-+ |           | |  |           |
+---------------+  +->|accumulator|---------->| sequencer |-+->|  writer   |
+---------------+  |  |           |           |           | |  |           |
| AppendRequest |  |  +-----------+           +-----------+ |  +-----------+
|   +-+         |  |                                        |               
|   |3|         |--+                                        |  +-----------+
|   +-+         |  |                                        |  |           |
+---------------+  |                                        +->|replicator |
+---------------+  |                                           |           |
| AppendRequest |  |                                           +-----------+
|   +-+         |  |                                                        
|   |4|         |--+                                                        
|   +-+         |                                                           
+---------------+

Design

A goroutine executes the accumulator. It has a queue to receive a log entry from the Append RPC handler.

The back-of-the-envelope design of the accumulator looks like this:

type Accumulator struct {
    queue chan *accumulateTask

}

func (a *Accumulator) Send(*accumulateTask) error {
}

func (a *Accumulator) Close() error {
}

func (a *Accumulator) loop() error {
    tick := time.NewTick(MinAccumulateInterval)
    buffer := newAccumulateBuffer(MaxAccumulateSize)
    position := 0
    for {
        select {
            case at := <-queue:
                a.buffer.insert(position, at)
                position++
                if position < MaxAccumulateSize {
                    continue
                }
            case <- tick:
                if position == 0 {
                    continue
                }
        }
        sequencer.Send(buffer)
        tick.Reset(MinAccumulateInterval)
        buffer = newAccumulateBuffer(MaxAccumulateSize)
        position = 0
    }
}

It has two tunable parameters:

MinAccumulateInterval: The minimum interval of sending accumulated log entries to the sequencer. The accumulator keeps a log entry from a client until this interval expires unless the number of retained log entries exceeds the MaxAccumulateSize.
MaxAccumulateSize: The maximum number of log entries kept by the accumulator. If the number of retained log entries equals the MaxAccumulateSize, the accumulator sends log entries to the sequencer and resets the timer for MinAccumulateInterval.

Values to retain log entries can be pooled. Since the number of log entries that have to keep is fixed, it is easy to pool.

Currently, the Append RPC handler builds the write batch to be stored in the storage layer. Since the handler runs concurrently, the building write batch is also executed concurrently.
However, the accumulator has to build up the write batch instead of the Append RPC. If it causes head-of-line blocking, we can use some optimization like double buffering.

Challenges

Performance testing
Finding good parameters.

The text was updated successfully, but these errors were encountered:

ijsong self-assigned this Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(storagenode): introduce accumulator #415

perf(storagenode): introduce accumulator #415

ijsong commented Apr 11, 2023 •

edited

Loading

perf(storagenode): introduce accumulator #415

perf(storagenode): introduce accumulator #415

Comments

ijsong commented Apr 11, 2023 • edited Loading

Motivation

Design

Challenges

ijsong commented Apr 11, 2023 •

edited

Loading