A New Paradigm for Human Motion Generation Based on Cross-Modal Nested Alignment | Synapse