Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing | Synapse