Visually rich document understanding (VrDU) models rely on tightly coupled textual, layout, and visual representations. In real-world deployments, these models must continuously adapt to new document domains over time. However, naïve sequential fine-tuning leads to severe catastrophic forgetting due to shared parameters and strong cross-task interference. Existing continual learning approaches either constrain parameter updates, preserve output distributions, or uniformly suppress gradient directions associated with previous tasks. While effective in limited settings, these strategies fail to balance stability and plasticity in large multimodal transformers. We propose AMD-Proj, an adaptive memory-driven selective gradient projection framework for continual learning in document understanding. It models task knowledge using specific gradient subspaces and adaptively modulates incoming gradients based on their alignment with this memory, selectively blocking interfering directions while reinforcing reusable ones. An efficient truncated SVD mechanism with online subspace merging ensures bounded memory usage and scalability to large transformer-based architectures. We evaluate AMD-Proj on four VrDU benchmarks (FUNSD, SROIE, CORD, and BuDDIE) under a task-incremental learning setting using LayoutLMv2 and LayoutLMv3 backbones. Results show that AMD-Proj reduces catastrophic forgetting and improves F1-based stability over EWC, GPM, LwF, OWM, CUBER, TRGP and parameter-efficient fine-tuning methods. Extensive mechanistic analyses, including gradient spectrum decomposition and layer-wise reuse versus block dynamics, provide insight into how selective gradient projection controls optimization geometry during continual adaptation. These findings establish selective gradient projection as a principled and interpretable approach for continual learning in visually rich document understanding.
Building similarity graph...
Analyzing shared references across papers
Loading...
Abdellatif Sassioui
OCP Group (Morocco)
Yasser Elouargui
OCP Group (Morocco)
Mohamed El Kamili
University of Hassan II Casablanca
Technologies
University of Hassan II Casablanca
OCP Group (Morocco)
Building similarity graph...
Analyzing shared references across papers
Loading...
Sassioui et al. (Thu,) studied this question.
synapsesocial.com/papers/69ec5a8888ba6daa22dac1fe — DOI: https://doi.org/10.3390/technologies14050250
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: