What type of study is this?

This is a Experimental Study study.

September 17, 2025

GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications

Key Points

GaussDB-Vector significantly reduces inference costs for large language models, improving access and efficiency.
The system delivers real-time search capabilities and can handle concurrent inserts and deletes, ensuring data integrity.
Innovative design optimized for I/O operations supports large-scale distributed search and enhances performance.
Experimental results reveal GaussDB-Vector outperforms existing vector databases by 1 to 5 times.

Abstract

Vector databases are widely used as a fundamental tool for addressing the weaknesses of large language model (LLM) applications, specifically hallucinations and the high cost of inference. However, existing vector databases either cater to niche applications with low-latency in-memory search, or offer sophisticated data management capabilities but at the cost of low performance. To address these limitations, we propose GaussDB-Vector, a high-performance, real-time persistent vector database that excels in low-latency scalable search, real-time inserts and deletes, high availability, large-scale distributed search, and hybrid scalar-vector filtered search capabilities. These features are primarily achieved through an innovative storage architecture designed for a graph-based vector index, optimized for I/O operations and adaptable across various dataset sizes and dimensions, complemented by novel buffering strategies to further reduce I/O burdens. GaussDB-Vector supports product quantization, parallel search, and hardware acceleration via SIMD, GPUs, and NPUs in order to further accelerate queries. Experimental results show that GaussDB-Vector outperforms competitive baselines by a factor of 1 to 5 times.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sun et al. (Fri,) studied this question.

synapsesocial.com/papers/68d46ccf31b076d99fa69119 — DOI: https://doi.org/10.14778/3750601.3750619

Authors

Ji Sun

Tsinghua University

Guoliang Li

Shandong University of Traditional Chinese Medicine

James Pan

University of Wisconsin–Madison

Journals

Proceedings of the VLDB Endowment

Actions

Institutions

Tsinghua University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion