What type of study is this?

This is a Quantitative Study study (also classified as: Experimental Study).

September 20, 2025

SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation

Key Points

SyncAnimation offers real-time generation of audio-driven human avatars, improving the synchronization of body movements and facial expressions.
High-precision output from SyncAnimation ensures that lip movements are accurately matched with audio cues, improving visual realism.
The integration of audio-to-pose and audio-to-expression modules enhances the stability and quality of generated animations during silent periods.
This NeRF-based method addresses previous high computational costs, enabling effective implementation in real-time applications.

Abstract

Generating talking avatar driven by audio remains a significant challenge. Existing methods typically require high computational costs and often lack sufficient facial detail and realism, making them unsuitable for applications that demand high real-time performance and visual quality. Additionally, while some methods can synchronize lip movement, they still face issues with consistency between facial expressions and upper body movement, particularly during silent periods. In this paper, we introduce SyncAnimation, the first NeRF-based method that achieves audio-driven, stable, and real-time generation of speaking avatar by combining generalized audio-to-pose matching and audio-to-expression synchronization. By integrating AudioPose Syncer and AudioEmotion Syncer, SyncAnimation achieves high-precision poses and expression generation, progressively producing audio-synchronized upper body, head, and lip shapes. Furthermore, the High-Synchronization Human Renderer ensures seamless integration of the head and upper body, and achieves audio-sync lip. The project page can be found at https://syncanimation.github.io.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yujian Liu

Shanghai University of Medicine and Health Sciences

Shidang Xu

Sun Yat-sen University

Jing Guo

Beijing Institute of Technology

Actions

Institutions

South China University of Technology

Beijing Institute of Technology

Beijing University of Posts and Telecommunications

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider