What question did this study set out to answer?

This research aims to improve the accuracy of answers in retrieval-augmented generation by integrating semantic keywords into the retrieval process.

June 3, 2026

Enhanced RAG retrieval via semantic keywords and dynamic expansion

Key Points

This research aims to improve the accuracy of answers in retrieval-augmented generation by integrating semantic keywords into the retrieval process.
Developed an enhanced method that generates semantic keywords from document chunks using a large language model.
Incorporated both keyword and chunk embeddings into the retrieval process to assess similarities.
Implemented a dynamic expansion mechanism to extract additional relevant keywords in cases of low recall.
The retrieval method showed significant improvement in answer accuracy when semantic keywords were included.
Dynamic expansion increased the relevance of retrieved content by aligning queries more closely with document chunks.

Abstract

An enhanced retrieval method for Retrieval-Augmented Generation (RAG) is presented, aimed at improving answer accuracy through semantic keyword integration. For each document chunk, a large language model (LLM) generates representative semantic keywords that are converted into embeddings and incorporated into the retrieval process along with the embeddings of the original chunk. During retrieval, both keyword and chunk similarities are considered. In cases of low recall, an auxiliary step allows the LLM to scan documents sequentially and extract additional relevant keywords. This dynamic expansion mechanism enhances the alignment between queries and relevant content, addressing limitations of traditional embedding-only retrieval approaches.

Bookmark

Cite This Study

Kai et al. (Mon,) studied this question.

synapsesocial.com/papers/6a1fc42cdee9eb8c0dce5bce https://doi.org/https://doi.org/10.1049/icp.2026.1965

Bookmark