Temporal Feature Prediction in Audio–Visual Deepfake Detection | Synapse