Lifelong learning agents require the ability to detect, without supervision, when the world they have been modelinghas changed—aboundarywedenoteB3(segmentation). Apriortoy‐scaleresearchprogrammappedfivemechanistically distinct failures of unsupervised segmentation in 3‐state synthetic environments, converging on the diagnosisthat a small worldstarves the regime‐changesignal of degrees of freedom. ThepresentpaperreportsthatB3crossescleanly in rich semantic environments when the segmentation detector operates on the cluster of correlated prediction failures produced by a generative world model, rather than on a scalar surprise signal.
Rahul Chouhan (Mon,) studied this question.