Reasoning models have demonstrated remarkable progress in solving complex and logic-intensive tasks by generating extended Chain-of-Thoughts (CoTs) prior to arriving at a final answer. Yet, the emergence of this "slow-thinking" paradigm, with numerous tokens generated in sequence, inevitably introduces substantial computational overhead. To this end, it highlights an urgent need for effective acceleration. This survey aims to provide a comprehensive overview of recent advances in efficient reasoning. It categorizes existing works into three key directions: (1) shorter - compressing lengthy CoTs into concise yet effective reasoning chains; (2) smaller - developing compact language models with strong reasoning capabilities through techniques such as knowledge distillation, other model compression techniques, and reinforcement learning; and (3) faster - designing efficient decoding strategies to accelerate inference of reasoning models. A curated collection of papers discussed in this survey is available in our GitHub repository: https://github.com/fscdc/Awesome-Efficient-Reasoning-Models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sicheng Feng
Gongfan Fang
Xinyin Ma
Building similarity graph...
Analyzing shared references across papers
Loading...
Feng et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68f6196ee0bbbc94fac361b6 — DOI: https://doi.org/10.48550/arxiv.2504.10903