From Sound to Sight: Audio-Visual Fusion and Deep Learning for Drone Detection | Synapse