Sample-efficient Audio-Visual Learning of Scene Acoustics | Synapse