Semi-supervised text-audio contrastive learning method using pseudo-text input | Synapse