Extending Large Language Models for Speech and Audio Captioning | Synapse