This cross-sectional study evaluates whether the performance of large language models on medical benchmarks reflects logical reasoning or pattern recognition.
Building similarity graph...
Analyzing shared references across papers
Loading...
Bedi et al. (Fri,) studied this question.
synapsesocial.com/papers/68c1bd2a54b1d3bfb60edf49 — DOI: https://doi.org/10.1001/jamanetworkopen.2025.26021
Suhana Bedi
Digital Science (United States)
Yixing Jiang
University of Maryland, Baltimore
Philip Chung
Stanford Medicine
JAMA Network Open
Stanford University
Stanford Medicine
Building similarity graph...
Analyzing shared references across papers
Loading...