May 30, 2024

SWDance: Transfer Learning a Text-To-Motion Model to Generate Choreography Conditioned on Spoken Word

Key Points

Key points are not available for this paper at this time.

Abstract

This study introduces novel text-conditioned dance motion dataset SWDance, along with a transfer-learned diffusion model, MDMSWD, for generating dance sequences conditioned on spoken word text. To address the scarcity of dance datasets, particularly text-to-dance datasets, we propose a YouTube-sourced pipeline to collect text-to-motion data quickly and easily. Furthermore, this study is the first to generate dance motions based on non-descriptive text. Despite a neutral user preference, MDMSWD exhibited no significant disadvantage compared to ground truth. Participants expressed a strong interest in using an improved version of the model in their dance practice. The results of the study suggest exciting possibilities at the intersection of AI, dance and spoken word.

Bookmark

Cite This Study

Hertog et al. (Thu,) studied this question.

synapsesocial.com/papers/68e67cb4b6db6435876067ee https://doi.org/https://doi.org/10.1145/3658852.3659079

Bookmark