What question did this study set out to answer?

The aim is to systematically review the development and efficiency enhancements of Large Language Models and their underlying architectures.

May 20, 2026Open Access

Architectural Evolution of Large Language Models: A Systematic Survey on Attention Mechanisms, Efficiency, Adaptation, and Emerging Paradigms

Key Points

The aim is to systematically review the development and efficiency enhancements of Large Language Models and their underlying architectures.
Conducted a systematic survey of architectural advancements in Large Language Models.
Analyzed various mechanisms including efficient attention and retrieval-augmented generation.
Explored emerging paradigms like mixture-of-experts and multimodal architectures.
Identified significant advancements in attention mechanisms that enhance processing efficiency.
Found that multimodal LLMs improve performance across diverse data types.
Noted the rise of new paradigms, such as Retrieval-Augmented Generation, that optimize inference.

Abstract

This survey paper reviews the architectural evolution of Large Language Models (LLMs), including Transformer architectures, efficient attention mechanisms, LoRA, Retrieval-Augmented Generation (RAG), Mixture-of-Experts (MoE), multimodal LLMs, inference optimization, and emerging paradigms in modern generative AI systems.

Read Full Paperexternally

Perguntar à IA

Bookmark

View Full Paper

Cite This Study

Gadara Kriskumar (Mon,) studied this question.

synapsesocial.com/papers/6a0d5122f03e14405aa9d785 https://doi.org/https://doi.org/10.5281/zenodo.20265846

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Perguntar à IA

Bookmark

View Full Paper