How to develop highly reliable services based on k8s? Words from Container Yunniu People - Personal Space of Qiniu Cloud - OSCHINA - Chinese Open Source Technology Exchange Community

How to develop highly reliable services based on k8s? Container Yunniuren has something to say

original

2018/04/13 13:42

Number of readings 475

K8S is the current mainstream container orchestration service, which mainly solves the "management problem" of "container applications" in "cluster environment", mainly including the following aspects:

Container cluster management • Orchestration • Scheduling • Access infrastructure management • Computing resources • Network resources • Storage resources The strength of the k8s depends on its good design concepts and abstractions, attracting more and more developers to invest in the k8s community, and the number of companies that use the k8s as an infrastructure operation service is gradually increasing.

In terms of design concept, only APIServer communicates with etcd (storage) in k8s. Other components maintain state in memory and persist data through APIServer. The triggering of management component actions is level based rather than edge based, and the corresponding actions are performed according to the "current state" and "expected state" of resources. K8s adopts hierarchical design, based on various abstract interfaces, and different plug-ins meet different needs.

In terms of abstraction, different workloads serve different applications, such as deployment for stateless applications and StatefulSet for stateful applications. In terms of access management, Service decouples service providers and accessors inside the cluster, and Ingress provides access management from outside the cluster to inside the cluster.

Although K8S has good design concept and abstraction, its steep learning curve and imperfect development materials greatly increase the difficulty of application development.

Based on the author's development practice, this sharing will take MySQL on k8s as an example to describe how to develop highly reliable applications based on k8s, The best practices should be abstracted as far as possible to reduce the cost of developing highly reliable applications based on k8s.

MySQL on k8s

The design and development of applications cannot be separated from business requirements. The requirements for MySQL applications are as follows:

High reliability of data
High availability of services
Easy to use
Easy operation and maintenance

In order to achieve the above requirements, it is necessary to rely on the cooperation of k8s and applications, that is, to develop highly reliable applications based on k8s, which requires both k8s related knowledge and knowledge in the application field.

The following will analyze the corresponding solutions according to the above requirements.

1. High reliability of data

The high reliability of data generally depends on these aspects: • redundancy • backup/recovery

We use Percona XtraDB Cluster as the MySQL cluster solution. It is a multi master MySQL architecture, and real-time data synchronization between instances is achieved based on Galera Replication technology. This cluster scheme can avoid the possible data loss of master slave architecture clusters during master-slave switchover, and further improve the reliability of data.

In terms of backup, we use xtrabackup as the backup/recovery scheme to realize hot backup of data, which does not affect the normal access of users to the cluster during backup.

While providing "scheduled backup", we also provide "manual backup" to meet the business needs for backup data.

2. High availability of services

Here, we will analyze it from the perspectives of "data link" and "control link".

The "data link" is a link for users to access MySQL services. We use the MySQL cluster scheme of three master nodes to provide access to users through TLB (the four layer load balancing service developed by Qiniu). TLB not only realizes the load balancing of the access layer to the MySQL instance, but also realizes the health detection of the service, automatically removes the abnormal node, and automatically joins the node when it recovers. As shown below:

Based on the above MySQL cluster scheme and TLB, The exceptions of one or two nodes will not affect the normal access of users to the MySQL cluster, so as to ensure the high availability of MySQL services.

The "control link" is the management link of the MySQL cluster, which is divided into two levels: • Global control management • Control management of each MySQL cluster Global control management is mainly responsible for "creating/deleting clusters", "managing the status of all MySQL clusters", etc. It is implemented based on the concept of the operator. Each MySQL cluster has a controller that is responsible for "task scheduling", "health detection", "automatic fault handling", etc. of the cluster.

This kind of disassembly decentralized the management work of each cluster to each cluster, reducing the mutual interference of control links between clusters, and reducing the pressure on the global controller. As shown below:

Here is a brief introduction to the concept and implementation of Operator.

Operator is a concept proposed by CoreOS, which is used to create, configure, and manage complex applications. It consists of two parts: Resource • Customizing resources • Providing users with a simple way to describe their expectations of services Controller • Creating resources • Monitoring resource changes, which is used to realize users' expectations of services

The workflow is shown in the following figure:

That is:

Register CR (CustomResource) resources
Listen for changes in CR objects
The user performs CREATE/UPDATE/DELETE operations on the CR resource
Trigger the corresponding handler for processing

According to practice, we have abstracted the development operator as follows:

CR is abstracted into such a structure:

The operation on CR ADD/UPDATE/DELETE events is abstracted as the following interface:

On the basis of the above abstraction, Qiniu provides a simple Operator framework, which transparently enables operations such as creating CR and monitoring CR events, making it easier to develop an Operator.

We have developed MySQL Operator and MySQL Data Operator, It is used to create/delete clusters and manual backup/recovery.

Since each MySQL cluster has multiple types of task logic, such as "data backup", "data recovery", "health detection", and "automatic fault handling", the concurrent execution of these logic may cause exceptions, so the task scheduler is needed to coordinate the execution of tasks. The Controller plays a role in this regard:

Through the Controller and various workers, each MySQL cluster realizes self operation and maintenance.

In terms of "health detection", we have implemented two mechanisms: • passive detection • active detection "passive detection" means that each MySQL instance reports the health status to the Controller, and "active detection" means that the Controller requests the health status of each MySQL instance. These two mechanisms complement each other to improve the reliability and timeliness of health detection.

The controller and operator will use the health detection data, as shown in the following figure:

The health detection data is used by the Controller in order to timely discover the exceptions of the MySQL cluster and handle the corresponding failures, so accurate and timely health status information is required. It maintains the status of all MySQL instances in memory, updates the instance status according to the results of "active detection" and "passive detection", and performs corresponding processing.

The operator uses the health detection data to reflect the running condition of the MySQL cluster to the outside world, and intervene in the fault handling of the MySQL cluster when the controller is abnormal.

In practice, due to the relatively high frequency of health detection, a large number of health states will be generated. If each health state is persistent, the Operator and APIServer will suffer from huge access pressure. Because these health states are only the most recent data meaningful, Therefore, at the controller level, the health status to be reported to the operator is inserted into a queue with limited capacity. When the queue is full, the old health status will be discarded.

When the Controller detects an exception in the MySQL cluster, it will automatically handle the failure.

First define the fault handling principles: • No data loss • No impact on availability as far as possible • Automatic handling of known and treatable faults • No automatic handling of unknown and non treatable faults, manual intervention in fault handling, There are these key issues: • What are the types of failures • How to detect and sense failures in a timely manner • Whether a failure has occurred at present • What type of failure has occurred • How to deal with the above key issues, we have defined three levels of cluster status:

Green • External service available • Number of running nodes meets the expectation Yellow • External service available • Number of running nodes does not meet the expectation Red • External service unavailable

At the same time, the following statuses are defined for each mysqld node:

Green • The node is running • The node is in the MySQL cluster Yellow • The node is running • The node is not in the MySQL cluster Red clean • The node exits Red clean gracefully • The node exits Unknown gracefully • The node status is unknown

After collecting the status of all MySQL nodes, the Controller will calculate the status of the MySQL cluster according to the status of these nodes. When it is detected that the MySQL cluster status is not Green, the "fault handling" logic will be triggered, and the logic will be processed according to the known fault handling scheme. If the fault type is unknown, handle it manually. The whole process is as follows:

Due to the different fault scenarios and handling schemes for each application, the specific handling methods will not be described here.

3. Easy to use

Based on the concept of Operator, we have implemented highly reliable MySQL services and defined two types of resources for users, namely QiniuMySQL and QiniuMySQL Data. The former describes the user's configuration of the MySQL cluster, and the latter describes the task of manually backing up/restoring data. Here we take QiniuMySQL as an example.

Users can trigger the creation of MySQL clusters through the following simple yaml files:

After the cluster is created, the user can obtain the cluster status through the status field of the CR object:

Here is another concept: Helm.

Helm is a package management tool for k8s. It standardizes the delivery, deployment and use process of k8s applications by packaging them as Charts.

Chart is essentially a collection of k8s yaml files and parameter files. In this way, applications can be delivered through one Chart file. Helm can deploy and upgrade applications with one click by operating Chart.

Due to the space and versatility of the Helm operation, the specific use process will not be described here.

4. Easy operation and maintenance

In addition to the above "health detection" and "automatic fault handling" as well as the delivery and deployment of Helm management applications, the following issues need to be considered in the operation and maintenance process: • monitoring/alarm • log management

We use prometheus+grafana as the monitoring/alarm service. The service exposes metric data to prometheus through HTTP API, and the prometheus server pulls it regularly. Developers can visualize the monitoring data in prometheus in grafana, and set alarm lines in the monitoring chart according to their grasp of the monitoring chart and application. Grafana can realize the alarm.

This way of visual monitoring before alarm has greatly enhanced our grasp of application operation characteristics, Specify the indicators and alarm lines that need attention, and reduce the amount of invalid alarms.

In development, we realize the communication between services through gRPC. In the gRPC ecosystem, there is an open source project called go grpc prometheus. By inserting a few lines of simple code into the service, you can monitor and control all rpc requests of the gRPC server.

For container services, log management includes two dimensions: log collection and log rolling.

We print the service log to the syslog, and then transfer the syslog log to the stdout/stderr of the container by some means to facilitate external log collection in a conventional way. At the same time, the logrotate function is configured in the syslog to automatically roll logs, so as to avoid service exceptions caused by logs occupying full container disk space.

To improve development efficiency, we use https://github.com/phusion/baseimage-docker As a basic image, it has built-in syslog and lograte services. Applications only care about logging into syslog, and do not care about log collection and log rolling.

Summary

Through the above description, the complete MySQL application architecture is as follows:

In the process of developing highly reliable MySQL applications based on k8s, with the in-depth understanding of k8s and MySQL, we continue to abstract, The following general logic and best practices are gradually implemented in modules: • Operator development framework • Health detection service • Fault automatic processing service • Task scheduling service • Configuration management service • Monitoring service • Log service • etc

With the modularization of these general logic and best practices, when developing new k8s based highly reliable applications, developers can quickly build up k8s related interactions like "building blocks". Such applications have been highly reliable since the beginning because they have applied best practices. At the same time, developers can shift their attention from the steep learning curve of K8S to the application field, and enhance the reliability of services from the application itself.

Niuren said

The Niurenshuo column is devoted to the discovery of the thoughts of technologists, including technical practice, technical dry goods, technical insights, growth experience, and all the technical content worth being discovered. We hope to gather the best technicians to dig out unique, sharp and contemporary voices.

Xiao Xu Middle aged 2024-06-08 12:43

Do AI functions need networking? Will it be 404?

zzeric 2024-04-28 20:01

Although France is the parent community, the core developers of OCCT on github are all Russians. Without Russians, the French parent community cannot continue to operate. So Huawei took over, moved to China, changed its name and resumed open source and community operations. What's the problem?

GDWhisperer 2024-05-15 17:23

I transferred tens of thousands of yuan to my own account, which was under risk control. How did I do this? The bank should be responsible for this**

Ma Nong Little Fatty Brother 2024-05-16 14:40

I give you six seconds. I give you six moves with the same effect in the martial arts contest, which shows the invincibility and confidence of the master

osc_27546117 2024-06-09 22:36

Learned electric programming and expected its progress

monkey_cici 2024-05-09 00:25

My I9 CPU, 64GB memory module and 3080Ti computer are inferior to the top configuration of 19999 on a tablet

Shuimu Yi'an 2024-05-20 09:58

The news should be read continuously. I'm waiting for the third news besides rustdesk and teamviewer. Localized remote desktop software is far ahead.

xiaoqibabby 2024-05-15 17:36

The bank is strongly required to be responsible for

Monkeys think of apes 2024-05-31 18:31

You can cheat your brother. Just don't cheat yourself

Francesca 2024-05-19 18:00

Wine runs the Android emulator of Windows. Chrome OS is installed in the Android emulator. Linux environment is installed in chrome OS. Linux environment is installed in the Linux environment. Wine is installed in the Android emulator

kangert 2024-06-09 20:07

The problem of docker hub is very uncomfortable

lyh97157268 2024-06-09 20:58

Like c++

intown 2024-06-07 18:20

I can't pull down any mirror image these two days

Small and beautiful software development 2024-06-08 23:03

It's mainly about waist training

osc_92224065 2024-04-29 10:57

Long term oppressed outsourcing of state-owned enterprises

-SORA- 2024-04-30 17:07

When this happened in a foreign country, the comment area suddenly became very objective and rational**

One code Yma 2024-05-09 09:58

Recently, I often go to interviews. People who hate Ali background most regard me as a fool, even though I am a fool

sunday12345 2024-05-15 18:31

What does the bank do? It's blamed on the remote desktop. Persimmons really pick up soft pinches~?

muwanqing123 2024-06-09 08:28

Bullshit authentication

zoujiaqing 2024-06-07 21:22

I dare not use it

yh2216 2024-06-09 13:15

osc_566335 2024-04-28 14:44

This is also called floor washing? Does it mean that Tesla will not wash the floor if it releases all the source code? Some people HWptds? That is to say, the language is ambiguous, which will also rise to the washing ground? Are some people too focused? Think the people he pays attention to must be staring at?

generation

Code e person 2024-06-09 10:03

Prepare the next project and try it

abeet 2024-06-08 20:38

There are no pictures, for fear that we will learn, right

Bright 2024-05-19 23:25

What a fool! I killed myself. How can people deal with me later.

yh2216 2024-06-09 23:03

I remember saying that one year C++was the language of the year,

Qin Liming 2024-05-11 09:12

be devoid of any sense of shame

Yoona520 2024-05-17 16:34

Zhou Hongyi is now living more and more like a clown. If he stays behind the scenes, he has to become an online celebrity. Can you learn from Lei Jun?

H Fine water and long flow H 2024-06-10 09:39

I haven't heard about whether fartran has paid. I'm in the top ten

kangert 2024-06-09 20:10

Really need to practice

Ding Yun H 2024-06-07 20:44

There is no querydsl. Since querydsl was used, I can't look at other forms anymore

Xiao_f 2024-06-07 22:59

One thing to say, compared with other domestic manufacturers, Qwen's relaxed licensing fully demonstrates the style of a large factory

brucepapa 2024-06-09 21:02

I also have several backaches... After a few days of exercise, it will be much better to focus on stretching the back muscles.

One code Yma 2024-05-06 09:14

My technical article was moved by CSDN. Why didn't anyone step on the sewing machine? This kind of report is a joke to me. The monsters with background are fine, and the monsters without background fight to death

zoujiaqing 2024-06-07 21:21

Spring boot was not updated last year

Francesca 2024-06-09 13:21

But the end of closed source must be open source, because many people who are dissatisfied with closed source have created open source, so the end of open source is not necessarily closed source, but to find a business model that is open source= Free Admission

Single structure 2024-05-11 10:09

Selected as Open Source China's disgrace pillar

Chief taxi captain 2024-05-17 11:17

I suggest that 360 open source all its products, and then become the leading enterprise in the domestic open source industry through open source, leading everyone to compete with foreign enterprises

MrChen89 2024-04-29 09:18

There are a group of people like this. I don't know what they have experienced. When it comes to HW, I can't say anything good, even if it's neutral

Li Yinghui 2024-05-09 16:40

Buddhism has a good word, evil opinion. In dealing with the world, it is meaningless to draw conclusions from preset positions; It is also important to receive good logic training.

Xiaoming, the teacher of Mingjiao 2024-06-07 17:50

There is no need to spray. There are many companies that can make money. Meituan, Didi and Ruixing are all making money. At least they are doing something

SnailJob 2024-06-09 09:13

Yes, please continue to follow Snail Job

golyu 2024-06-10 14:45

If only this was the library of solidjs

Wang Zheng 2024-06-08 09:46

You said, "All the tests are graduate students" and smiled. I don't know my level is low.

Happy LeapFrog 2024-05-18 09:18

But the question is: "What's the use of this for ordinary Android users?" Now the answer seems to be: "Almost nothing.".

Francesca 2024-06-10 16:19

Be ignorant. This thing has a long history. It is used for scientific computing and has high performance

oldpig 2024-04-28 09:59

”Huawei contributed all the source code "?, the title is completely inconsistent with the content.

Yeah, for 2024-05-17 13:42

That's too right. Old Zhou can't control Google, but he can control 360. Do not do to others what you do not want. All 360 products should be opened first.

infoworld 2024-05-11 15:12

Universities should use open source free software instead of commercial ones. In this way, hands and feet will not be tied technically.

pan3793 2024-06-07 22:26

Let AI give AI a score

gamedot 2024-05-17 11:14

Old Zhou is deeply concerned about Huawei's great cause of open source. He is not a Huawei person, but has Huawei's soul.

kakai 2024-05-10 10:21

The world only knows that Android was created by Google. Several people know that Android is only a product acquired by Google. Similarly, what is the problem with Huawei's contribution to the collection of OGG open source work and integration into its own proprietary product line?

Kevin586 2024-06-08 14:41

Dream is garbage, which can also be listed and refresh my cognition

zhuzhua 2024-05-21 10:08

I'm laughing to death. Those who have been deeply kidnapped dare not pay? Who will use the domestic open source framework of small companies in the future will be 213!!! Wait for harvesting later

zhy 2024-05-16 13:16

At the end of Shannon is Nong

zhangleijie 2024-06-08 10:08

pretty good

Xiao Xu Middle aged 2024-06-08 10:12

First place in making money!! Money and treasures will be plentiful

Xiaoxia cat ball 2024-06-09 21:29

Very good, come on

Xiao Xu Middle aged 2024-06-10 07:05

Learn

CodeDoger 2024-05-02 20:48

35 It's too old to go to work and too early to retire at 60

How to develop highly reliable services based on k8s? Container Yunniuren has something to say

MySQL on k8s

Summary

Niuren said

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number

How to develop highly reliable services based on k8s? Container Yunniuren has something to say

MySQL on k8s

Summary

Niuren said

Hot content

Popular comments of the whole site

About the author

Author's Album

Author's other popular articles

Hot News

Recommended attention

Hot software

OSCHINA Community

Online tools

Introduction

QQ group

Public account

Video number