Information Center

Alibaba Cloud Hong Kong has returned to normal, which is caused by the cooling failure of the leased machine room

  

On December 19, Alibaba Cloud announced that the cooling failure of the machine room that caused the abnormality in Hong Kong zone C had been handled, and the cloud product functions in the zone had returned to normal.

 WeChat Picture_20221220152310

Review the whole process of this fault:

Morning of December 18

Alibaba Cloud publishes the notice of "Alibaba Cloud Hong Kong Zone C A Machine Room Device Exception".

"Alibaba Cloud Monitoring has found that the equipment in a computer room in Hong Kong is abnormal, which affects the use of cloud products such as ECS and cloud database PolarDB in zone C in Hong Kong. Alibaba Cloud engineers are already in the process of emergency handling."

Afternoon of December 18

The official microblog of the Macao Judicial Police Bureau confirmed that the websites and platforms of many units in Macao could not be used due to Alibaba Cloud failure.

The Network Security Accident Early Warning and Emergency Response Center received a message earlier today that the failure of Alibaba Cloud's Hong Kong machine room node led to the failure of the websites of key infrastructure operators such as the Macau Monetary Authority, Galaxy Macau, Lotus Satellite TV, and Macau Cement Factory, as well as takeout platforms such as Omi and MFood, and local media applications such as Macau Daily. Since today (18th) Since noon, access is temporarily unavailable. The Network Security Center has contacted relevant key infrastructure operators and followed up relevant issues. "

Afternoon of December 18

The troubleshooting shows that Alibaba Cloud Hong Kong's failure was confirmed to be caused by the failure of the refrigeration equipment in the PCCW machine room in Hong Kong. The failure affects the use of cloud products such as ECS, cloud database, storage products (object storage, table storage, etc.), cloud network products (global acceleration, NAT gateway, VPN gateway, etc.) in Zone C in Hong Kong. This failure also affected console access and API call operations in Hong Kong. At present, Alibaba Cloud engineers are cooperating with PCCW room engineers to speed up processing, and some refrigeration equipment is being recovered.

December 19th

The Alibaba Cloud official website announced that the fault has been repaired and the cloud product functions are being restored in succession.

At present, the refrigeration equipment fault of the machine room rented by Alibaba Cloud from Hong Kong PCCW has been repaired, and the cloud product functions of all zones in Alibaba Cloud Hong Kong are returning to normal. For the products affected by this failure, Alibaba Cloud will make compensation according to the SLA agreement of the related products.

Another refrigeration failure, another rented machine room

stay Data Center Among the business interruption factors, refrigeration failure ranked third. The refrigeration failure and low refrigeration efficiency caused by the compressor, safety valve or water failure will cause the temperature of the machine room to rise, affecting the performance of the equipment. If it cannot be handled in time, the temperature of the machine room will continue to rise, or the machine room will be shut down due to overheating, service interruption, hardware damage, and data loss.

For cloud manufacturers, it is not uncommon for the failure of the cooling equipment in the computer room to lead to the downtime of cloud products.

In 2020, Microsoft Azure's data center in the eastern United States will experience service interruption for six hours. Microsoft later disclosed that a cooling system failure was the cause of the outage. The faulty building automation control led to the reduction of air flow, and then the peak temperature of the entire data center hindered the performance of network devices, making computing and storage instances inaccessible.