What type of study is this?

This is a Literature Review study.

September 10, 2025

Human Motion Video Generation: A Survey

Key Points

This survey offers a comprehensive overview of human motion video generation, covering over ten sub-tasks.
Key phases of the generation process include input, motion planning, and refinement among others.
The analysis reviews significant advancements across three modalities: vision, text, and audio.
This resource aims to drive innovation in applications of digital humans and their integration into various fields.

Abstract

Human motion video generation has garnered significant research interest due to its broad applications, enabling innovations such as photorealistic singing heads or dynamic avatars that seamlessly dance to music. However, existing surveys in this field focus on individual methods, lacking a comprehensive overview of the entire generative process. This paper addresses this gap by providing an in-depth survey of human motion video generation, encompassing over ten sub-tasks, and detailing the five key phases of the generation process: input, motion planning, motion video generation, refinement, and output. Notably, this is the first survey that discusses the potential of large language models in enhancing human motion video generation. Our survey reviews the latest developments and technological trends in human motion video generation across three primary modalities: vision, text, and audio. By covering over two hundred papers, we offer a thorough overview of the field and highlight milestone works that have driven significant technological breakthroughs. Our goal for this survey is to unveil the prospects of human motion video generation and serve as a valuable resource for advancing the comprehensive applications of digital humans. A complete list of the models examined in this survey is available in Our Repository.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haiwei Xue

Hong Kong University of Science and Technology

Xiangyang Luo

Peng Cheng Laboratory

Zhanghao Hu

Art Institute of Portland

Journals

IEEE Transactions on Pattern Analysis and Machine Intelligence

Actions

Institutions

Tsinghua University

University of Chinese Academy of Sciences

Fudan University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Human Motion Video Generation: A Survey

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider