Evaluation of the performance of large language models in determining RADS scores from radiology reports | Synapse