Raft algorithm analysis - ksfzhaohui's personal page - OSCHINA - Chinese open source technology exchange community

Analysis of Raft algorithm

preface
In the previous article ZAB protocol and Paxos algorithm The consistency protocol ZAB in Zookeeper mentioned in is essentially a simplification and optimization of Paxos. It can be seen that the complexity of Paxos (mainly because there is no primary secondary relationship between multiple concurrent processes) and even the problem of live locks may occur, which makes the specific implementation more complex. The Raft consistency algorithm to be introduced below is precisely in this environment.
Raft is a consistent algorithm designed by Diego Ongaro and John Ousterhout of Stanford to make it easy to understand. In 2013, Raft published a paper: 《In Search of an Understandable Consensus Algorithm》 Up to now, there have been more than ten languages of Raft algorithm implementation framework, and the more famous one is etcd. Google's Kubernetes also uses etcd as his service discovery framework.

About Raft
Raft's design is mainly based on two goals: the first is comprehensibility. Under the premise of achieving the same function, comprehensibility is the first criterion; The second point is to achieve the certainty of the actual system. Raft pursues the clear definition of each technical detail, so as to achieve the clarity when implementing specific systems.
In order to achieve the above two goals, Raft decomposed the consistency problem into three small problems:
1. Leader election: select the Leader, who is responsible for responding to the client's request
2. Log replication: log replication, synchronization
3. Safety: security

Basic concepts
1. Role
Each server has three statuses: Leader, Follower, Candidate
Leader: There is only one server in Leader status in the cluster, which is responsible for responding to requests from all clients
Follower: All nodes are in the Follower status when they are just started, responding to the Leader's log synchronization request and the Candidate request
Candidate: The status that the Follower status server needs to transition to before it is ready to launch a new Leader election is the intermediate status between the Follower and the Leader
The conversion relationship between the three can be referred to the following figure (source online):

2. Term
In Raft, a concept that can be understood as a cycle is used, and Term is used as a cycle; Raft divides the execution time of the whole system into a sequence of several Terms (cycles) with different time interval lengths, and uses an increasing number as the number of Term; Each term starts from the Election. During this time, several servers in the Candidate state compete to generate new Leaders. There are two situations:
1. If a server becomes a Leader, it will become a new Leader in the next time
2. If no leader is elected, the term will increase progressively to start the election for a new term
For more intuitive reference, see the figure below (source online):

It can be said that every time the term increases, a new round of elections will take place. Raft ensures that there is at most one leader in a term; Let's take a look at three independent sub problems.

Raft protocol steps
1. Leader election
When the whole system is started, all servers are in the Follower state; If there is a leader in the system, the leader will periodically send heartbeat to tell other servers that it is a leader. If the follower does not receive any heartbeat information after a period of time, it can be considered that the leader does not exist, and leader election is required.
Before the election, Follower increases its Term number and changes the status to Candidate, and then sends RequestVote RPC to other servers in the cluster. This status lasts until any of the following three events occur:
1. It won the election: Candidate accepted the vote of most servers, became the Leader, and then sent the heartbeat to other servers to tell them.
2. Another server won the election: Candidate received an RPC message from the server calling itself Leader while waiting. If the term number of this RPC is greater than or equal to the term number of Candidate itself, Candidate acknowledges the Leader and its status becomes Follower; Otherwise, the leader is rejected and the status is still Candidate.
3. A period of time has passed and no new leader has been generated: in this case, the Term will increase and the election will be re launched; The reason why this happens is that it is possible for multiple Followers to change to Candidate status at the same time, resulting in diversion without obtaining a majority of votes.

2. Log replication
Log replication is mainly used to ensure the consistency of nodes. The operations in this phase are also used to ensure consistency and high availability; When the leader is elected, he/she will be responsible for the client's requests. All requests must be processed by the leader first. These requests or commands are also called logs here. After receiving the client command, the leader appends it to the tail of the log, and then issues AppendEntries RPC to other servers in the cluster, which causes other servers to copy the new command. When most servers copy, the leader applies the operation command to the internal state machine, and returns the execution result to the client.
The log structure is shown in the following figure (source network):

The items in each log contain two contents: the operation command itself and the term number; There is also a global Log Index to indicate the sequence number of Log items in the Log. When most servers store the project in the Log, it can be considered that the project can be submitted. For example, the project before the Log Index in the above figure is 7 can be submitted.

3. Safety
Security is a security mechanism used to ensure that each node executes the same sequence. For example, when a follower is unavailable when the current leader submits a command, the follower may later be elected as the leader. At this time, the new leader may overwrite the previously submitted log with a new log, which causes the node to execute different sequences; Security is a mechanism used to ensure that the elected leader must include the previously submitted log.
In order to achieve safety, Raft added two constraints:
1. It is required that only those servers whose logs contain all submitted operation commands can be selected as leaders.
2. For a new leader, only when he/she has submitted the operation command of the current Term can he/she be considered as a true submission.

summary
Compared with Paxos, Raft has certain advantages in understandability and clarity when implementing the system, which is why Raft algorithm has been widely used in just a few years; ZAB essentially simplifies and optimizes Paxos, so Raft and ZAB still have many similarities. You can compare them separately. This is intended to be compared in future articles.

kangert 2024-06-09 20:07

The problem of docker hub is very uncomfortable

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

kangert 2024-06-09 20:10

Really need to practice

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

Ding Yun H 2024-06-07 20:44

There is no querydsl. Since querydsl was used, I can't look at other forms anymore

iVista 2024-06-10 18:13

I was blinded by the math test

Xiao Xu Middle aged 2024-06-08 12:43

Do AI functions need networking? Will it be 404?

muwanqing123 2024-06-09 08:28

Bullshit authentication

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

yh2216 2024-06-09 13:15

Like c++

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

Small and beautiful software development 2024-06-08 23:03

It's mainly about waist training

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

Wang Zheng 2024-06-08 09:46

You said, "All the tests are graduate students" and smiled. I don't know my level is low.

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

pan3793 2024-06-07 22:26

Let AI give AI a score

brucepapa 2024-06-09 21:02

I also have several backaches... After a few days of exercise, it will be much better to focus on stretching the back muscles.

generation

Code e person 2024-06-09 10:03

Prepare the next project and try it

intown 2024-06-07 18:20

I can't pull down any mirror image these two days

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

golyu 2024-06-10 14:45

If only this was the library of solidjs

abeet 2024-06-08 20:38

There are no pictures, for fear that we will learn, right

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

zoujiaqing 2024-06-07 21:21

Spring boot was not updated last year

lyh97157268 2024-06-09 20:58

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

SnailJob 2024-06-09 09:13

Yes, please continue to follow Snail Job

Francesca 2024-06-10 16:19

Be ignorant. This thing has a long history. It is used for scientific computing and has high performance

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

osc_27546117 2024-06-09 22:36

Learned electric programming and expected its progress

Xiao Xu Middle aged 2024-06-10 07:05

Learn

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

Kevin586 2024-06-08 14:41

Dream is garbage, which can also be listed and refresh my cognition

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

zhuzhua 2024-05-21 10:08

I'm laughing to death. Those who have been deeply kidnapped dare not pay? Who will use the domestic open source framework of small companies in the future will be 213!!! Wait for harvesting later

Xiaoxia cat ball 2024-06-09 21:29

Very good, come on

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

H Fine water and long flow H 2024-06-10 09:39

I haven't heard about whether fartran has paid. I'm in the top ten

zhy 2024-05-16 13:16

At the end of Shannon is Nong

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

Xiao_f 2024-06-07 22:59

One thing to say, compared with other domestic manufacturers, Qwen's relaxed licensing fully demonstrates the style of a large factory

Francesca 2024-06-09 13:21

But the end of closed source must be open source, because many people who are dissatisfied with closed source have created open source, so the end of open source is not necessarily closed source, but to find a business model that is open source= Free Admission

Xiao Xu Middle aged 2024-06-08 10:12

First place in making money!! Money and treasures will be plentiful

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

zoujiaqing 2024-06-07 21:22

I dare not use it

zhangleijie 2024-06-08 10:08

pretty good

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

yh2216 2024-06-09 23:03

I remember saying that one year C++was the language of the year,

Analysis of Raft algorithm

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

Analysis of Raft algorithm

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Recommended attention

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number