Enhancing classroom behavior analysis with multimodal data: a cross-attention fusion network approach | Synapse