From CNNs to Transformers in Multimodal Human Action Recognition: A Survey | Synapse