What question did this study set out to answer?

The research aims to develop a framework, COGITAO, for evaluating compositional and systematic generalization in machine learning models.

June 14, 2026Open Access

COGITAO : a procedural and object-centric framework to evaluate compositional and systematic generalization

Key Points

The research aims to develop a framework, COGITAO, for evaluating compositional and systematic generalization in machine learning models.
Introduced a modulable data-generation framework called COGITAO for object-centric domains.
Created rule-based tasks using a set of 28 interoperable transformations in grid-based environments.
Released benchmark datasets and baseline results from various state-of-the-art architectures.
Despite high performance within specific tasks, models struggle to generalize to new combinations of known elements.
Benchmark datasets reveal the challenges in achieving effective compositional and systematic generalization.
COGITAO enables the creation of millions of unique task rules, significantly enhancing testing capabilities.

Abstract

The ability to compose learned concepts and apply them in novel settings is key to human intelligence, but remains a key challenge in state-of-the-art machine learning models. To address this issue, we introduce COGITAO, a modulable datageneration framework to evaluate compositional and systematic generalization in object-centric domains. Drawing inspiration from ARC-AGI’s environment and problem-setting, COGITAO constructs rule-based tasks to be solved by applying a set of transformations to objects in grid-based environments. It supports composition over a set of 28 interoperable transformations, at adjustable composition-depth, along with extensive control over grid parametrization and object properties. This flexibility enables creating millions of unique task rules – surpassing existing datasets by several orders of magnitude – across a broad range of difficulties, while allowing virtually unlimited sample generation per rule. Alongside open-sourcing our flexible data-generation framework, we release benchmark datasets and provide baseline results with several SOTA architectures that incorporate inductive biases well-suited for compositionality, such as diffusion-based Transformers (LLaDA) or recurrent Transformers with Adaptive Computation Time. Despite strong in-domain performance, these models consistently fail to generalize to novel combinations of familiar elements – highlighting a persistent challenge in compositional and systematic generalization, which COGITAO allows to precisely characterize.

Bookmark

View Full Paper

Bookmark

View Full Paper

COGITAO : a procedural and object-centric framework to evaluate compositional and systematic generalization

Key Points

Abstract

Cite This Study