Localizing Before Answering: A Benchmark for Grounded Medical Visual Question Answering | Synapse