Customer Stories > Case Details
Polar Krypton auto APP comprehensively upgrades cloud native technology architecture

Alibaba Cloud Service Pole Krypton Auto APP comprehensively upgraded the cloud's original technical architecture, effectively improved the user's travel experience, and helped 001 hunting coupe win the high-end pure electric market.

Customer introduction

Geekrypton Automobile is a new brand of high-end pure electric vehicles under Geely Holding Group. In April 2021, Geekrypton released its first high-end intelligent electric vehicle model, Geekrypton 001, which won great praise in the market. By December 2022, the cumulative delivery of 001 model has exceeded 70000, and won the sales champion of luxury pure tram models with more than 300000 independent brands for three consecutive months.

Jikrypton adheres to the service experience beyond cars. In addition to providing customers with excellent products, it connects users through its APP, and launches diversified innovative initiatives including community interaction, charging and energy supplement, vehicle control, subscription travel, certification services, boutique Jiwu shopping malls, vehicle orders, after-sales service malls, etc, Realize the full life cycle management of the product and the full scenario coverage of the user's journey. From customers wanting to know about relevant models to those who are interested in buying, collecting and using the car, sharing feelings, charging and energy supplement, car control and after-sales problems seeking solutions, all these links have been integrated into this APP. Geekrypton APP is a good helper for car owners who can provide intelligent car control to grasp the car conditions at any time and anywhere, and also provide the ultimate travel experience of buying good things and sharing community activities, so that car owners can obtain car information and make travel more convenient and interesting.

Business challenges

With the launch of Krypton APP, the number of registered users has shown an explosive growth, and user identities and their corresponding use scenarios have also been expanding. In this process, the experience of APP users has become more and more important. How to ensure that the APP can meet the three high scenarios of the Internet, as well as the stability, fluency, and agility of rapid response needs while the user scale is growing rapidly has brought certain challenges to the research and development team of Krypton App.

  • High requirements for business continuity

The core APP services such as remote car control, online map and 3C mall are very demanding on business continuity, and they all need to be online for 7 * 24 hours. In particular, in the face of peak season sales activities, new model releases, hot emergencies and other situations, the APP is facing high concurrent and large traffic pressure. There are some functional failures, pages cannot be opened, and interface delays are too high. In extreme cases, the APP is completely inaccessible, which has a serious impact on user experience.

  • Version release pressure is high and frequency is fast

With the increasing demand for user scenarios, more and more functions are waiting to be released and launched, and the requirements for iteration frequency are getting higher and higher. However, due to the lack of full link grayscale release capability on the APP server, in order to ensure the stability of the business, customers can only choose to release each time in the low peak period of the business in the early morning. Students in development, operation and maintenance, and testing suffer terribly, It is urgent to realize the ability of lossless release at any time.

  • Technical architecture urgently needs iteration

At the beginning of the company's establishment, in order to meet the business and realize the rapid launch of APP, the overall design of the technical architecture was slightly inadequate, which was reflected in many problems such as high coupling between some businesses, long system links, different technical implementation standards, and unreasonable cloud product selection. For example, through research, it was found that the request link of a core interface was too long, leading to high latency jitter rate, It affects the user experience.

"Thank you for your great support for the Polar Krypton APP architecture cloud native upgrade project. Through more than half a year of comprehensive cooperation, TAM students helped us with cloud native architecture and product best practices through professional technical capabilities and mature solutions, improved the stability and agility of the C end business system, and provided a solid technical support for hitting higher sales targets. This project is just the beginning of the cooperation between the two parties. Next, we hope to improve the availability of business through projects such as live in different places, and bring better travel experience and fun to our car end users. "-- Polar Krypton Development Director

Alibaba Cloud solutions
Upgrade cloud native architecture
fourteen

Carry out 14 special treatments

Alibaba Cloud's service team went deep into Krypton's business, conducted a comprehensive survey and assessment of the customer's APP status based on the eight SRE systems, identified technical architecture problems and risks, and exported an overall transformation plan for Krypton's original APP cloud architecture, focusing on 14 special projects, including full link grayscale publishing, microservice governance, capacity planning, elastic scaling, and cloud product optimization.

Full link grayscale

As the sales of Krypton cars become more and more popular, their registered users and daily active users grow rapidly, and more and more business scenarios and new functions need to be supported, with an average upgrade frequency of a small version every two or three days and a large version every half a month. Under the original architecture, in order not to affect the daytime business peak, each version can only be released in the early morning business low peak period. Alibaba Cloud service team helps customers develop and launch canary release plans: deploy grayscale versions and perform grayscale verification according to traffic ratio. After verification, release the production environment and cut the stream, so as to meet the demand of releasing polar krypton small versions at any time during the day. For the scenario where multiple microservices on the polar Krypton core business link need to be issued at the same time, the full link grayscale of multiple services is achieved based on MSE cloud native gateway and traffic grayscale marking, covering grayscale scenarios such as CDN, gateway, MQ, configuration, and database. Multi service daytime release is realized without changing any business code. At the same time, it is verified through gradual traffic amplification. If there is a problem, the traffic can be cut back in time, reducing the stability risk that may be caused by daytime release. Through the transformation of cloud efficiency pipeline, it helps Krypton realize the automatic release of core business and better improve the deployment efficiency.

Non destructive online and offline

The Krypton APP application uses a microservice architecture. When doing business version publishing, elastic capacity expansion and other scenarios, it will encounter problems such as increased request failure rate and constant restart of POD. In view of this problem, combined with the product capabilities of the microservice engine MSE, we realized the lossless online and offline publishing of microservices by means of adaptive waiting and active notification during the application offline process, readiness inspection during the application online process, service preheating, etc., effectively avoiding the loss of flow during the publishing process, and reducing the risk of business access failure. The MSE traffic prevention and control capability is introduced, and the corresponding technical means are implemented for the core business scenarios, such as interface flow restriction and degradation, MQ peak shaving and valley filling, slow SQL flow restriction management of database, etc., to improve the overall stability of the service.

split horizon

With the rapid development of business, the problem of insufficient capacity under the original architecture of Krypton APP has become increasingly prominent. In the face of new car release, sales activities, and sudden hot spots, it is impossible to rapidly expand horizontally. In addition, a large number of core business databases are placed on the same database instance, which is prone to "damage and loss". Alibaba Cloud recommends PolarDB-X products to Krypton Polar, and cooperates with Krypton Polar to separate the core databases such as the point database and user database one by one. It solves the problem of too large a single table through horizontal splitting of large tables, and improves the horizontal elastic capacity expansion capability of the database layer. In addition, for the pain point of insufficient elasticity of microservices, output container elasticity schemes such as elastic expansion of multi zone nodes, HPA, CronHPA, etc., to improve the core service's ability to respond to traffic emergencies.

Upgrade cloud native products

Krypton APP uses more AliCloud products, and its business has developed rapidly. There are many unreasonable ways to use Krypton APP. Through multiple rounds of scheme discussions, Alibaba Cloud has formulated a cloud native product optimization scheme for Krypton Polar, including ACK cluster version upgrade, cloud native gateway replacement, operating system upgrade, etc. Migrate the microservice gateway, traffic gateway, Nginx ingress, API gateway, etc. to the cloud native gateway, reduce unnecessary link consumption and maintenance costs of different products, improve the observability of core links and reduce network latency. Based on Alibaba Cloud ARMS products, observable standards are formulated for different roles' concerns in development, testing, and operation and maintenance, covering different scenarios such as front-end, APP, microservices, and containers, so as to achieve end-to-end full stack performance monitoring and end-to-end link tracking capabilities, and improve the ability to locate problems or failures.

Full link voltage test tuning

In order to clarify the concurrent capacity that the Polar Krypton APP can actually carry, and assist Polar Krypton in multi round full link pressure testing and tuning of the core business interface. Through full link pressure testing, not only has the effect of special transformation schemes such as grayscale release, lossless online and offline, and elastic scalability been verified, but also its confidence in switching the production system to the cloud native architecture has been enhanced.

Business Value

Based on cloud native technology architecture, effectively improve the travel experience of car users

Alibaba Cloud service team and customer R&D technical team have worked together for half a year to successfully complete the upgrading project of the original architecture of Krypton auto APP cloud through 14 special comprehensive and in-depth governance projects, including full link grayscale release, microservice governance, capacity planning, elastic scaling, and cloud product optimization, helping customers' core systems improve business stability, agility, and flexibility, The travel experience of users has been effectively improved.