Bridging vision and touch: advancing robotic interaction prediction with self-supervised multimodal learning | Synapse