This survey paper reviews the architectural evolution of Large Language Models (LLMs), including Transformer architectures, efficient attention mechanisms, LoRA, Retrieval-Augmented Generation (RAG), Mixture-of-Experts (MoE), multimodal LLMs, inference optimization, and emerging paradigms in modern generative AI systems.
Gadara Kriskumar (Mon,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: