Cross-Modal Learning with 3D Deformable Attention for Action Recognition | Synapse