Key points are not available for this paper at this time.
We study hybrid search in text retrieval where lexical and semantic search are fused together with the intuition that the two are complementary in how they model relevance. In particular, we examine fusion by a convex combination of lexical and semantic scores, as well as the reciprocal rank fusion (RRF) method, and identify their advantages and potential pitfalls. Contrary to existing studies, we find RRF to be sensitive to its parameters; that the learning of a convex combination fusion is generally agnostic to the choice of score normalization; that convex combination outperforms RRF in in-domain and out-of-domain settings; and finally, that convex combination is sample efficient, requiring only a small set of training examples to tune its only parameter to a target domain.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sebastian Bruch
Siyu Gai
Amir Ingber
ACM Transactions on Information Systems
University of California, Berkeley
Pine Technical and Community College
Building similarity graph...
Analyzing shared references across papers
Loading...
Bruch et al. (Sat,) studied this question.
synapsesocial.com/papers/6a025512e7b2554f3af60204 — DOI: https://doi.org/10.1145/3596512