There is an increasing demand for extending existing DBMSs with vector indices to become unified systems that can support modern predictive applications, which require joint querying of vector embeddings and structured properties and connections of objects. We present NaviX, a Na tive v ector i nde X for graph DBMSs (GDBMSs) that has two main design goals. First, we aim to implement a disk-based vector index that leverages the core storage and query processing capabilities of the underlying GDBMS. To this end, NaviX is a hierarchical navigable small world (HNSW) index, which is itself a graph-based structure. Second, we aim to evaluate predicate-agnostic filtered vector search queries, where the k nearest neighbors (kNNs) of a query vector υ Q are searched across an arbitrary subset S of vectors that is specified by an ad-hoc selection sub-query Q S . We adopt a prefiltering-based approach that evaluates Q S first and passes the full information about S to the kNN search operator. We study how to design a prefiltering-based search algorithm that is robust under different selectivities as well as correlations of S with υ Q . We propose an adaptive algorithm that utilizes local selectivity of each vector in the HNSW graph to pick a suitable heuristic at each iteration of the kNN search algorithm. We demonstrate NaviX's robustness and efficiency through extensive experiments against both existing prefiltering- and postfiltering-based baselines.
Building similarity graph...
Analyzing shared references across papers
Loading...
Gaurav Sehgal
Semih Salihoğlu
Proceedings of the VLDB Endowment
University of Waterloo
Building similarity graph...
Analyzing shared references across papers
Loading...
Sehgal et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68c18f329b7b07f3a06155fa — DOI: https://doi.org/10.14778/3749646.3749704
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: