Rethinking Overlooked Aspects in Vision-Language Models | Synapse