Evaluation of large language models for diagnostic impression generation from brain MRI report findings: a multicenter benchmark and reader study | Synapse