What question did this study set out to answer?

The goal is to develop a framework that allows AI agents to continually refine their context without altering their underlying model.

March 18, 2026Open Access

Towards Self-Evolving Agents: A Dual-Process Framework for Continual Context Refinement

Key Points

The goal is to develop a framework that allows AI agents to continually refine their context without altering their underlying model.
Introduced the Dual-Process Agent (DPA) framework for interaction episodes.
Utilized a fast System 1 for quick responses and a slow System 2 for reflection.
Maintained bulletized memory entries with statistics to prevent degradation.
Employed a curator gate to filter out unhelpful memory updates.
Conducted experiments on six diverse benchmarks to evaluate performance.
DPA consistently outperformed vanilla prompting and competitive baselines.
Achieved best overall performance across multiple reasoning and knowledge-intensive tasks.
Showcased effective continual context refinement in AI interactions.

Abstract

Large Language Models (LLMs) have become essential for interactive AI systems, yet they remain fundamentally static after deployment: they cannot update their parameters from interaction feedback and often repeat the same mistakes across long interaction streams. We propose Dual-Process Agent (DPA), a framework for continual context refinement that enables learning without modifying a frozen model backbone. Inspired by dual-process theory from cognitive science, DPA decomposes each interaction episode into two complementary processes: a fast System 1 that retrieves compact, relevant context from an explicit long-term memory and generates responses, and a slow System 2 that reflects on outcomes and writes curated updates back into memory. To prevent memory degradation over extended interactions, DPA maintains bulletized memory entries with utility statistics and employs a conservative curator gate that filters generic, redundant, or conflicting insertions. Experiments on six diverse benchmarks demonstrate that DPA consistently outperforms vanilla prompting and competitive baselines on both GPT-5.1 and Llama-3.1-8B backbones, achieving the best overall performance across multiple reasoning and knowledge-intensive tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Liangyu Teng

Wei Ni

Liang Song

Journals

Electronics

Actions

Institutions

Fudan University

China State Construction Engineering (China)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Towards Self-Evolving Agents: A Dual-Process Framework for Continual Context Refinement

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider