Key points are not available for this paper at this time.
Abstract Transformer-based language models have significantly advanced our understanding of meaning representation in the human brain. Prior research utilizing smaller models like BERT and GPT-2 suggests that “next-word prediction” is a computational principle shared between machines and humans. However, recent advancements in large language models (LLMs) have highlighted the effectiveness of instruction tuning beyond next-word prediction. It remains to be tested whether instruction tuning can further align the model with language processing in the human brain. In this study, we evaluated the self-attention of base and finetuned LLMs of different sizes against human eye movement and functional magnetic resonance imaging (fMRI) activity patterns during naturalistic reading. Our results reveal that increases in model size significantly enhance the alignment between LLMs and brain activity, whereas instruction tuning does not. These findings confirm a scaling law in LLMs’ brain-encoding performance and suggest that “instruction-following” may not mimic natural language comprehension in humans.
Building similarity graph...
Analyzing shared references across papers
Loading...
Changjiang Gao
Zhengwu Ma
Jiajun Chen
Hong Kong Polytechnic University
City University of Hong Kong
Nanjing University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Gao et al. (Fri,) studied this question.
www.synapsesocial.com/papers/68e5c0e0b6db643587558276 — DOI: https://doi.org/10.1101/2024.08.15.608196
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: