Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs | Synapse