Evaluating Robustness and Diversity in Visual Question Answering Using Multimodal Large Language Models | Synapse