Joint encoding of the waveform and speech recognition features using a transform codec | Synapse