Introduce Follower Replication #33

Fullstop000 · 2019-11-13T15:14:59Z

Relate to tikv/raft-rs#249
Signed-off-by: Fullstop000 fullstop1005@gmail.com

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

siddontang · 2019-11-15T02:45:17Z

text/2019-11-13-follower-replication.md

+# Follower Replication
+
+## Summary
+This RFC introduces a new mechanism in Raft Protocol which allows a follower to send raft logs to other followers and learners.  The target of this feature is to reduce network transmission costs between different data centers in Log Replication.


I think this can also reduce the pressure of the leader when it has many followers or learners.

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

siddontang · 2019-11-15T03:26:41Z

can you paste your previous benchmark results here, so others can see the benefit intuitively.

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

Fullstop000 · 2019-11-15T14:33:50Z

@siddontang I'll make a more concrete benchmark for this. Some previous results are hard to explain why. But I'm ok to post them here :).

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

siddontang · 2019-11-18T06:36:11Z

text/2019-11-13-follower-replication.md

+    2. The progress state should be `Replicate` but not `paused`
+    3. The progress has the smallest `match_index`
+
+3. If no delegate is picked, the leader does Log Replication itself. Especially, if a group contains the leader it self, no delegate will be set by default except in some cases such as massively large group, which is able to be controlled by upper layer.


how do we do by the upper layer for the group which contains the leader?

It's unnecessary for the upper layer to acknowledge which group contains the leader because only the leader can choose delegates. And the leader itself must know which group it belongs to.

The description here might be somewhat confused. which is able to be controlled by upper layer means that the upper layer can decide whether the leader can choose a delegate in the group itself belongs to or not. I'll make it more clear.

text/2019-11-13-follower-replication.md

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

text/2019-11-13-follower-replication.md

* update for the implementation is changed Signed-off-by: qupeng <qupeng@pingcap.com> * address comments Signed-off-by: qupeng <qupeng@pingcap.com>

abbccdda

Thanks for the detailed design. Could we have a compatibility section, to discuss things such as some nodes in a group are still in old version, and could not recognize a delegate's request for entry replication?

Fullstop000 · 2020-07-15T04:03:01Z

@abbccdda A node in a group will send its group_id in msg and the receiver will update the sender's group info based on it. The leader only picks a delegate when the group info is enough. In a rolling-upgrade/downgrade situation, This can introduce several cases:

Upgrade

If only the leader uses follower replication, it can only know the group info until followers finish upgrading and send their group_id so that the leader uses origin log replication at this point
If only a follower or part of them use follower replication, the leader will just ignore the group_id in the msg so no delegate will be picked and origin log replication keeps processing

Downgrade

If only the leader use origin log replication, the case is just like common raft cluster and nothing special happens (pick a delegate, broadcast appends)
If a follower is downgraded and stop informing the leader its group_id, the leader will remove it from the group system and send entries directly to it

By such a design, the compatibility can be guaranteed when nodes in the cluster use either origin log replication or follower replication

It seems this feature description is missing in the RFC. I'll add it soon.

abbccdda · 2020-07-20T03:36:34Z

@Fullstop000 Thanks for the reply. It would be good to add such details to RFC for sure :)

abbccdda · 2020-07-20T03:40:00Z

text/2019-11-13-follower-replication.md

+
+There are four key concepts of design:
+
+- Every peer in a Raft group is associated with a `group_id`.


I was actually thinking whether we could make the delegation group support native in the first version. Suppose we want to form another delegation group in runtime, we need to change the static configs and do the cluster rolling bounce.

While it makes sense for the first version to focus on static configs, we may also propose runtime formulation of delegation group through third-party control such as PD. When the leader load goes up, it makes sense to distribute its load to other up-to-date followers. We could briefly talk about how we could keep the door open for such a future improvement, with the first version only supporting static group_id.

Good idea! Actually it's the first design of how the leader manages all the group info.

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

add RFC for Follower Replication

362865a

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

Fullstop000 mentioned this pull request Nov 14, 2019

Proposal: Introduce Follower Replication in Raft etcd-io/etcd#11357

Closed

siddontang reviewed Nov 15, 2019

View reviewed changes

Fullstop000 added 2 commits November 15, 2019 11:08

format

cb4970f

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

revise Motivation

58c83f7

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

make things more clear

1e46be1

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

revise detailed design

1027e21

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

Hoverbear assigned Fullstop000 Nov 15, 2019

Hoverbear added the Initial Comment Period This RFC is in the initial comment period, and has quite some time to give input on. label Nov 15, 2019

siddontang reviewed Nov 18, 2019

View reviewed changes

text/2019-11-13-follower-replication.md Outdated Show resolved Hide resolved

address comments

9c24706

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

hicqu mentioned this pull request Nov 25, 2019

Incubating Program: Follower Read With Applied Index pingcap/community#86

Closed

9 tasks

Merge branch 'master' into follower-replication

0f5bf2d

hicqu reviewed Jan 16, 2020

View reviewed changes

text/2019-11-13-follower-replication.md Outdated Show resolved Hide resolved

hicqu reviewed Jan 16, 2020

View reviewed changes

text/2019-11-13-follower-replication.md Outdated Show resolved Hide resolved

update for the implementation is changed (#1)

1854281

* update for the implementation is changed Signed-off-by: qupeng <qupeng@pingcap.com> * address comments Signed-off-by: qupeng <qupeng@pingcap.com>

Hoverbear added Final Comment Period This RFC is in the final comment period, and has a limited amount of time to give input on. and removed Initial Comment Period This RFC is in the initial comment period, and has quite some time to give input on. labels Jan 20, 2020

abbccdda reviewed Jul 14, 2020

View reviewed changes

abbccdda reviewed Jul 20, 2020

View reviewed changes

add compatibility section

dc858e5

Signed-off-by: Fullstop000 <fullstop1005@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Follower Replication #33

Introduce Follower Replication #33

Fullstop000 commented Nov 13, 2019 •

edited

Loading

siddontang Nov 15, 2019

siddontang commented Nov 15, 2019

Fullstop000 commented Nov 15, 2019

siddontang Nov 18, 2019

Fullstop000 Nov 18, 2019 •

edited

Loading

abbccdda left a comment

Fullstop000 commented Jul 15, 2020

abbccdda commented Jul 20, 2020

abbccdda Jul 20, 2020

Fullstop000 Jul 20, 2020


		There are four key concepts of design:

		- Every peer in a Raft group is associated with a `group_id`.

Introduce Follower Replication #33

Are you sure you want to change the base?

Introduce Follower Replication #33

Conversation

Fullstop000 commented Nov 13, 2019 • edited Loading

siddontang Nov 15, 2019

Choose a reason for hiding this comment

siddontang commented Nov 15, 2019

Fullstop000 commented Nov 15, 2019

siddontang Nov 18, 2019

Choose a reason for hiding this comment

Fullstop000 Nov 18, 2019 • edited Loading

Choose a reason for hiding this comment

abbccdda left a comment

Choose a reason for hiding this comment

Fullstop000 commented Jul 15, 2020

Upgrade

Downgrade

abbccdda commented Jul 20, 2020

abbccdda Jul 20, 2020

Choose a reason for hiding this comment

Fullstop000 Jul 20, 2020

Choose a reason for hiding this comment

Fullstop000 commented Nov 13, 2019 •

edited

Loading

Fullstop000 Nov 18, 2019 •

edited

Loading