August 16, 2025Open Access

Diffusion-based Large Language Models Survey

Puntos clave

Diffusion-based large language models enhance parallel generation and controllability across multiple modalities, indicating a shift in generative model design.
Key advancements include autoregressive-diffusion unification and adaptive correction sampling, improving efficiency and output quality.
The survey organizes current methods by sampling strategy, noise schedule, guidance type, and examines fine-tuning in recent models.
Identifying challenges like scalable alignment strategies and robust evaluation protocols highlights the need for future research in the field.

Resumen

Diffusion-based large language models (DLLMs) have emerged as a promising alternative to traditional autoregressive architectures, notably enhancing parallel generation, controllability, and robustness across multiple modalities. Originally developed from continuous diffusion methods in computer vision, recent adaptations of DLLMs have tailored discrete diffusion processes through absorbing-state kernels, latent projections, and hybrid architectures. This survey reviews recent developments in DLLMs, beginning with their foundational concepts, including DDPM, DDIM, and their early discrete adaptations, such as mask-based, continuous-embedding, and hybrid models. We organize current methods by sampling strategy, guidance type, noise schedule, and temporal conditioning, and analyzes their efficiency, output quality, and fine-tuning. The paper also highlights key advancements: autoregressive-diffusion unification through hyperschedules, adaptive correction sampling, and efficient caching mechanisms to enhance computational performance. Besides, it explores emerging applications, such as natural language tasks, multimodal generation, and reasoning-intensive domains... These demonstrate the versatility of DLLMs. Furthermore, the paper identifies critical challenges, including adaptive sampling, scalable alignment strategies, deeper integration with pretrained language models, graph-based diffusion frameworks, and robust evaluation protocols. Finally, the paper proposes directions that could define future research in diffusion-based sequence generation.

Leer artículo completoexternamente

Preguntar a la IA

Me gusta

Guardar

Ver artículo completo