We present the first controlled experiment comparing four memory conditions in a long-running AI coding agent. Hybrid memory (4.95/5.0) outperforms experiential (4.55), baseline (3.30), and synthetic (2.65). v2.1: date correction (February 2026).
Tom Lee (Fri,) studied this question.