Vector database_model knowledge base_vector data storage

Forrester authoritative recognition and recommendation!
Forrester released the vector database market report, and Tencent Cloud Vector Database was recognized and recommended by authorities
Go now
The "first" has passed the evaluation of the authority
In the first batch of "trusted database" vector database product testing of the Chinese Academy of Information and Communications, Tencent Cloud Computing (Beijing) Co., Ltd. (Tencent Cloud for short) successfully completed the first vector database product basic capability evaluation.
Go now
"First" Industry Standard Released
The first vector database standard in China jointly prepared by China Academy of Information Technology, Tencent Cloud Computing (Beijing) Co., Ltd., China Mobile (Suzhou) Software Technology Co., Ltd. and other enterprises was officially released
Go now

Product characteristics

Embedding function

Data writing/retrieval is automatically vectorized to align with the use experience of traditional databases. Users do not need to pay attention to the vector generation process, which greatly reduces the threshold of use.

End to end AI kit

The end-to-end solution for content retrieval in the RAG field integrates the functions of document pre-processing, automatic vectorization, retrieval and fine sorting in the vector database, simplifies the data processing and retrieval process in RAG, and improves the efficiency of data access.

High performance

The vector database Tencent Cloud VectorDB single index supports a vector data scale of 100 billion levels, and can support millions of QPS and millisecond level query latency.

High availability

The vector database Tencent Cloud VectorDB provides multi copy high availability features, improves disaster tolerance, and ensures that the database can still operate normally in the face of node failures, load changes and other challenges.

low cost

Simply follow the instructions of the console to quickly create vector database instances. The whole process platform is managed without any installation, deployment, operation and maintenance operations, reducing machine costs, operation and maintenance costs, and labor costs.

Stable and reliable

The vector database Tencent Cloud VectorDB is derived from the vector retrieval engine OLAMA developed by Tencent Group itself. Nearly 40 business lines are running stably, and the daily average number of search requests is up to 100 billion. The service continuity and stability are guaranteed.

Product specification

Free Admission

Free beta

Guangzhou

CPU
1 core
Memory
1G
disk
20GB
Number of nodes
1

Trial scenario: only for quick test

High cost performance

Stand alone version (computational)

Guangzhou

CPU
2 cores
Memory
4G
disk
20G
Number of nodes
1

Recommended for individuals and small enterprises
Computational for QPS high and delay sensitive scenarios
Vector scale: 50w (1536 dimensions)

High cost performance

Stand alone version (storage type)

Guangzhou

CPU
1 core
Memory
8G
disk
20GB
Number of nodes
2

Recommended for individuals and small enterprises
Storage type is used in scenarios with large data volume and low QPS
Vector scale: 100w (1536 dimensions)

High cost performance

High availability version (storage type)

Guangzhou

CPU
4-core
Memory
32G
disk
100GB
Number of nodes
3

Recommended for medium and large enterprises
Support configuration of 3 data replicas
Vector scale: 400w (1536 dimensions)

End to end AI kit

AI suite is a one-stop document retrieval solution provided by Tencent Cloud VectorDB, which includes automated document parsing, information supplement, vectorization, content retrieval and other capabilities, and has a wealth of configurable items, which can significantly improve the recall effect of document retrieval. Users can quickly build a high-quality proprietary knowledge base within a few minutes by uploading the original documents, greatly improving the efficiency of knowledge access.

Embedding function

Better experience

Automatic vectorization of data writing/retrieval, aligned with the use experience of traditional databases, without paying attention to the vector generation process

Performance improvement

Optimize GPU processing speed, improve performance by 5~10 times, and improve efficiency

Shared resource pool

Massive GPU resource pool provides computing services, which can be used on demand with better cost

Product architecture

Tencent Cloud VectorDB is based on the vector engine OLAMA of Tencent Group, which processes hundreds of billions of searches every day. The bottom layer uses Raft distributed storage, and cluster management and scheduling are carried out through the master node to achieve efficient operation of the system. At the same time, Tencent Cloud Vector Database supports the setting of multiple shards and replicas, which further improves the load balancing capability, enabling the vector database to achieve high performance, high scalability and high disaster tolerance while processing massive vector data.

Application scenarios

Large model knowledge base
Recommendation system
Text/image retrieval

Tencent cloud vector database can be used together with the big language model LLM. After text segmentation and vectorization, the private domain data of enterprises can be stored in Tencent Cloud Vector Database to build an enterprise specific external knowledge base, so as to provide prompt information for the big model in subsequent retrieval tasks and assist the big model to generate more accurate answers.

Go now

Embedding function

End to end AI kit

High performance

High availability

low cost

Stable and reliable

Free beta

CPU

Memory

disk

Number of nodes

Stand alone version (computational)

CPU

Memory

disk

Number of nodes

Stand alone version (storage type)

CPU

Memory

disk

Number of nodes

High availability version (storage type)

CPU

Memory

disk

Number of nodes

Better experience

Performance improvement

Shared resource pool

Product Introduction

Product specification

quick get start

Operation Guide

Database design

API documentation