The rapid proliferation of documents has made document intelligence increasingly critical across various industries. In recent years, large language models (LLMs) have dramatically transformed the field of document intelligence, allowing for more advanced and accurate document processing solutions. Despite these advancements, most existing surveys have failed to focus on these breakthroughs, instead concentrating on traditional methods and earlier machine learning techniques. This survey seeks to fill that gap by offering an in-depth analysis of approximately 300 papers published between 2021 and mid-2025, thus providing a comprehensive overview of the impact of LLMs in document intelligence. The key topics explored include retrieval-augmented generation (RAG), long context processing, and fine-tuning LLMs for document comprehension. Furthermore, the survey highlights essential datasets, practical applications, current challenges, and future research directions, offering critical insights for both researchers and industry practitioners looking to advance the field.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wenjun Ke
Southeast University
Yifan Zheng
Edinburgh College
Youlan Li
Hanyang University
ACM transactions on office information systems
Southeast University
University of Macau
Metacomp Technologies (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Ke et al. (Tue,) studied this question.
synapsesocial.com/papers/68d45b2931b076d99fa5daf7 — DOI: https://doi.org/10.1145/3768156