Localizing Visual Sounds the Hard Way | Synapse