August 23, 2024Open Access

LIA: Latent Image Animator

Key Points

Key points are not available for this paper at this time.

Abstract

Previous animation techniques mainly focus on leveraging explicit structure representations (e.g., meshes or keypoints) for transferring motion from driving videos to source images.However, such methods are challenged with large appearance variations between source and driving data, as well as require complex additional modules to respectively model appearance and motion.Towards addressing these issues, we introduce the Latent Image Animator (LIA), streamlined to animate high-resolution images.LIA is designed as a simple autoencoder that does not rely on explicit representations.Motion transfer in the pixel space is modeled as linear navigation of motion codes in the latent space.Specifically such navigation is represented as an orthogonal motion dictionary learned in a self-supervised manner based on proposed Linear Motion Decomposition (LMD).Extensive experimental results demonstrate that LIA outperforms state-of-the-art on VoxCeleb, TaichiHD, and TED-talk datasets with respect to video quality and spatiotemporal consistency.In addition LIA is well equipped for zeroshot high-resolution image animation.Code, models and demo videos are available at https://wyhsirius.github.io/LIA-project/.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yaohui Wang

Beijing Academy of Artificial Intelligence

Di Yang

National University of Defense Technology

François Brémond

Institut national de recherche en sciences et technologies du numérique

Journals

IEEE Transactions on Pattern Analysis and Machine Intelligence

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Wang et al. (Fri,) studied this question.

synapsesocial.com/papers/68e5b146b6db64358754b049 — DOI: https://doi.org/10.1109/tpami.2024.3449075

Also consider

Synapse has enriched 2 closely related papers on similar clinical questions. Consider them for comparative context:

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation· 2024 · 6 citations
Imagen Video: High Definition Video Generation with Diffusion Models· 2022 · 346 citations

LIA: Latent Image Animator

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider