End-to-end target speaker speech recognition with voice activity detection fusion | Synapse