A Unified Framework for Human Motion Generation with Multimodal Inputs | Synapse