Vision-Language Model for Generating Textual Descriptions From Clinical Images: Model Development and Validation Study | Synapse