FashionVLM - Fashion Captioning Using Pretrained Vision Transformer and Large Language Model | Synapse