⌘+K

December 1, 2023

Experimental Analysis of Large-Scale Learnable Vector Storage Compression

Key Points

Key points are not available for this paper at this time.

Abstract

Learnable embedding vector is one of the most important applications in machine learning, and is widely used in various database-related domains. However, the high dimensionality of sparse data in recommendation tasks and the huge volume of corpus in retrieval-related tasks lead to a large memory consumption of the embedding table, which poses a great challenge to the training and deployment of models. Recent research has proposed various methods to compress the embeddings at the cost of a slight decrease in model quality or the introduction of other overheads. Nevertheless, the relative performance of these methods remains unclear. Existing experimental comparisons only cover a subset of these methods and focus on limited metrics. In this paper, we perform a comprehensive comparative analysis and experimental evaluation of embedding compression. We introduce a new taxonomy that categorizes these techniques based on their characteristics and methodologies, and further develop a modular benchmarking framework that integrates 14 representative methods. Under a uniform test environment, our benchmark fairly evaluates each approach, presents their strengths and weaknesses under different memory budgets, and recommends the best method based on the use case. In addition to providing useful guidelines, our study also uncovers the limitations of current methods and suggests potential directions for future research.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hailin Zhang

Xidian University

Penghao Zhao

Peking University

Xupeng Miao

Purdue University West Lafayette

Journals

Proceedings of the VLDB Endowment

Actions

Institutions

Carnegie Mellon University

Peking University

Beijing University of Posts and Telecommunications

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zhang et al. (Fri,) studied this question.

synapsesocial.com/papers/6a0f97275725bbd5cc5fe37e — DOI: https://doi.org/10.14778/3636218.3636234

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Feature hashing for large scale multitask learning· 2009 · 937 citations
SlimDB· 2017 · 114 citations
Cardinality estimation of approximate substring queries using deep learning· 2022 · 12 citations
LOGER: A Learned Optimizer Towards Generating Efficient and Robust Query Execution Plans· 2023 · 45 citations
Approximate nearest neighbor algorithm based on navigable small world graphs· 2013 · 399 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Feature hashing for large scale multitask learning· 2009 · 937 citations
SlimDB· 2017 · 114 citations
Cardinality estimation of approximate substring queries using deep learning· 2022 · 12 citations
LOGER: A Learned Optimizer Towards Generating Efficient and Robust Query Execution Plans· 2023 · 45 citations
Approximate nearest neighbor algorithm based on navigable small world graphs· 2013 · 399 citations

Experimental Analysis of Large-Scale Learnable Vector Storage Compression

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider