What question did this study set out to answer?

This work aims to understand and unlock in-context learning (ICL) mechanisms in large language models across various modalities.

June 17, 2026Open Access

Unlocking In-Context Learning for Natural Datasets Across Modalities

Key Points

This work aims to understand and unlock in-context learning (ICL) mechanisms in large language models across various modalities.
Systematic analysis of token repetitions in training data sequences and their effect on ICL.
Training dynamics detailed for autoregressive models with varying task difficulty.
Application of novel insights to visual datasets for few-shot classification and real-world examples.
Identified exact token repetitions significantly improve ICL stability and performance.
Confirmed generalizability of insights through large-scale object classification and complex EEG classification tasks.
Enhanced ICL performance observed across multiple natural datasets and modalities.

Abstract

Abstract Large Language Models (LLMs) exhibit In-Context Learning (ICL), which enables the model to perform new tasks conditioning only on the examples provided in the context without updating the model’s weights. While ICL offers fast adaptation across natural language tasks and domains, its emergence is less straightforward for modalities beyond text. In this work, we systematically uncover properties present in LLMs that support the emergence of ICL for autoregressive models and various modalities by promoting the learning of the mechanisms needed for ICL. We identify exact token repetitions in the training data sequences as an important factor for ICL. Such repetitions further improve stability and reduce transiency in ICL performance. We analyse in detail the training dynamics of such data sequences and explain how token repetitions enhance the ICL learning mechanisms. Moreover, we emphasise the importance of the training task difficulty for the emergence of ICL. Finally, by applying our novel insights on ICL emergence, we unlock ICL capabilities across various visual datasets used for few-shot classification, and confirm the generalisability of our insights to much harder real-world examples of large-scale object classification, and a more challenging EEG classification task. Code is available at https: //github. com/jelenab98/unlockingᵢcl

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Bratulić et al. (Mon,) studied this question.

synapsesocial.com/papers/6a323cc9d50b63ecad206c8f https://doi.org/https://doi.org/10.1007/s11263-026-02913-0

AI에게 질문

Bookmark

View Full Paper