A Multimodal, Multi-Task Adapting Framework for Video Action Recognition | Synapse