-
Hot commodity -
Product Dynamics
Hologres3.0 new upgrade: integrated real-time lake warehouse platform
Collection of technical practices and 40+industry cases
Enhanced disaster recovery capability, supporting 3AZ local disaster recovery deployment
Quick Start to Dynamic Table

Unified data platform Solve the problem of data islands and inconsistent data caliber One piece of data for multiple scenarios can replace OLAP engine (Greenplum/Presto/Impala/ClickHouse, etc.) or KV database (HBase/Redis, etc.) at the same time. Fast data processing Solve the problem of analysis efficiency TPC-H 30000GB standard test result is the first in the world, 23% ahead of the second. It supports real-time write and update with high throughput of 1 billion+/sec, and petabyte data can be analyzed in seconds. Full link real-time Solve the problem of data timeliness It supports high-performance data real-time writing, real-time updating, and real-time query, maintains data freshness, and helps enterprises analyze data in real time.
Sub second interactive analysis Scalable MPP architecture, computing and storage index optimization give full play to extreme performance, and realize sub second analysis of petabyte level data. High performance primary key query Hundreds of thousands of QPS high-performance spot checks per second, supporting high throughput updates, and improving the performance by more than 10 times compared with open source. Vector computing Integrate the Proxima vector engine, and combine with the machine learning platform PAI to seamlessly connect various large models. Federated query There is no need for data movement, and the query of MaxCompute offline data is accelerated, and OSS data is read and written on the surface. High throughput real-time write and update Native integration with Flink, Spark and other computing frameworks supports real-time writing and updating of high-throughput data. Full link real-time Data can be checked immediately after being written in real time. Binlog transmission capability of table update events is supported to reduce data processing delay. Load isolation The resource group supports resource competition scenarios such as isolation of different businesses, different query types, and write and read to ensure stability. High reliability design Multiple computing group instances form a high availability mode, share a share of storage, and support fast recovery of failed nodes. Enterprise level operation and maintenance Provide rich monitoring and alarm information, support system hot upgrade, and meet various enterprise level operation and maintenance requirements.
-
First month test -
Online production -
Resource package
zero one Get a free trial one Get a free trial of Hologres two Get a free trial of DataWorks zero two Real time synchronization of RDS data one New Hologres database two Create a new DataWorks real-time synchronization task zero three real-time analysis one Real time analysis of data two Build a large data visualization screen
-
Unified digital warehouse solution -
Offline real-time all-in-one solution -
Streaming digital warehouse solution -
Lake warehouse integrated solution -
Vector Retrieval Solution -
High availability solutions

Unified storage Unified data storage, unified indicator caliber, no data islands, simplified architecture, and guaranteed data consistency. Standard SQL Perfect SQL capability, support complex multi table, nested, window and other queries, reduce learning costs and shorten development cycle. Unified data service layer One data supports multiple scenarios, such as large-scale multidimensional analysis and high QPS online services, with millisecond response and interactive analysis.
Hologres This product -
Big data development and governance platform DataWorks -
Real time calculation Flink version

Metadata auto discovery MaxCompute and Hologres have realized two-way metadata automatic discovery and refresh as well as perfect data type support. Data sharing and interworking Storage direct reading is more than 10 times faster than accessing ordinary surfaces, and supports two-way synchronization of data in millions of rows/second, simplifying data publishing and back flushing scenarios. Unified service export Hologres directly accelerates the query of MaxCompute data without data movement, reducing data redundancy and realizing BI acceleration.
Hologres This product -
Cloud native big data computing service MaxCompute

one-stop The whole link can be expressed in SQL, and Hologres data at each layer can be reused and checked, facilitating the construction of data layering and multiplexing system of real-time data warehouse. High performance Flink's powerful real-time computing is perfectly combined with Hologres' extreme real-time writing and updating capabilities, multi-dimensional OLAP and high concurrency spot checking capabilities. Enterprise level operation and maintenance The operation and maintenance is simpler, the observability is better, and the security capability is stronger. It provides a variety of high availability capabilities to facilitate the construction of an enterprise class streaming warehouse.
Hologres This product -
Real time calculation Flink version

High performance Use vector engine to accelerate OSS/DLF/MaxCompute. Openness It is convenient to import and export data, and data warehouse data flows freely. cost performance The exclusive instance lake warehouse resource reuse does not require additional cost calculation, and the shared cluster serverless mode pays by use.

Extreme performance It supports efficient index construction and vector retrieval with high concurrency and low latency. Real time capability Support vector data can be written and updated in real-time with high performance, and data can be checked after being written. Easy to use The use of vector computation can be completed through standard syntax.
Hologres This product -
Machine learning PAI

On demand expansion and reduction Warehouse can be pulled up on time or on demand (Scale Out); Warehouse dynamic thermal expansion (Scale Up); Computing and storage are highly scalable and dual elastic. Cost reduction and efficiency increase Users can use resources as needed, and the cost can be minimized; Based on physical replication, physical files are fully reused, reducing cost and increasing efficiency. Computing group resource isolation Each computing group is naturally isolated from each other in physical resources to avoid interaction between computing groups and reduce service jitter.
Hologres This product
Billing method
Pay as you go It is settled every hour. One master and multiple slave instances can be used to ensure load isolation View details Storage and computing resource package Deduct the calculation and storage fees based on the volume, which is lower than the cost of pay as you go method. View details
data security Storage transmission encryption : The storage supports visible and controllable semi managed encryption (BYOK), and separate encryption rules can be set for each table. SSL can be enabled to encrypt network connections at the transport layer. Data desensitization : It supports desensitization by column level, and desensitization policies are set for specified users. At the same time, it supports multiple desensitization rules, such as IP address desensitization, email address desensitization, Hash desensitization, etc.
system safety Permission management : Alibaba Cloud general RAM authentication is supported, and AccessKey is created for identity authentication. Support multiple permission models such as simple permission, expert permission and schema level simple permission Operational audit : AliCloud operation audit ActionTrail console OpenAPI、 Developer tools, etc., query the instance operation event log in the past 90 days, and provide Query log information.
network security Access isolation : The classic network, VPC network, and public network of each instance are isolated, and only the corresponding Endpoint and virtual intranet IP (VIP) can be accessed. IP whitelist : On the basis of various access authentications, when the white list function is enabled, only devices in the white list are allowed to access Hologres instances, and devices not in the white list cannot be accessed through authentication.
-
Product architecture -
performance optimization -
Enterprise level operation and maintenance
-
Q: Lambda or Kappa architecture for real-time data warehouse? -
A: Lambda architecture stores fragmented status, resulting in inconsistent data and caliber, while Kappa architecture cannot meet the requirements of frequent data correction and update. Hologres proposed HSAP architecture to achieve the integration of offline real-time data analysis services. View details
-
Q: How does Streaming Warehouse choose to ensure real-time performance? -
A: Hologres combined with Flink can directly replace Flink+Kafka, realizing real-time write and update of 1 billion+/second data with high throughput, and solving the problem of real-time data warehouse layering. View details
-
Q: How to achieve performance tuning through Hologres? -
A: Hologres can optimize processes such as data table construction and data query. Ali Mama has reduced the time spent on data analysis of 600 million people by 72% through practice. View details
-
Q: How does Hologres optimize the query performance of semi-structured data? -
A: Hologres upgraded JSONB columnar storage, improved query performance by 400%+, reduced storage by 45%, and saved thousands of cores (estimated cost savings of millions of yuan) in Taobao's dual 11 search scenario. View details
-
Q: How to realize self diagnosis and self operation and maintenance through Hologres? -
A: Hologres can reveal worker level monitoring indicators to help businesses more accurately locate problems and check resource usage, so as to improve the overall availability of the system. View details
-
Q: How to troubleshoot Hologres OOM problems? -
A: The OOM problem usually occurs in the query, data import/export and other scenarios. The main reason is that the memory consumption is too high. Hologres has a variety of ways to gradually solve the problem of high memory water level. View details
Unified OLAP analysis engine, Noah's best practice of financial digitalization Replace Impala/Greenplug/Elasticsearch and other components, unify the OLAP analysis platform, and combine the offline database capabilities of DataWorks and MaxCompute to achieve a simpler architecture, faster queries, and lower costs. 2022-10-19 Replacing Kudu, TAL's best practice of real-time digital warehouse TAL originally used Kudu as the underlying OLAP engine and used Impala for data loading and calculation. When the business volume was increased, Kudu's technical bottleneck began to emerge. Through Hologres, it realized the ability of million level writing and millisecond level query, reducing the cost by nearly one million per year. 2022-08-10 Replace Hive+Presto, the best practice of Leyuan real-time digital warehouse Through testing, Leyuan found that the performance of Hologres has improved by 5~10 times compared with Presto. 64 core Hologres can directly replace 96 core Presto clusters, so it upgraded the digital warehouse architecture and increased the business operation efficiency by 10 times+. 2022-11-14