What question did this study set out to answer?

The aim is to explore how to inject new operational capabilities into large language models without retraining.

April 16, 2026Open Access

Capability Injection Without Retraining

Key Points

The aim is to explore how to inject new operational capabilities into large language models without retraining.
Utilized STAR (Structured Tree with Active Retrieval) persistent memory framework.
Employed a 4-billion parameter Gemma 4 model on a consumer smartphone.
Investigated the effects of epistemic framing and authorship fidelity on capability injection.
Injected capabilities can be reliably adopted and autonomously applied.
The model successfully executed a custom device control capability from vague instructions.
Demonstrated understanding of intended web search behaviors despite incomplete execution.

Abstract

Previous attempts to extend the operational capabilities of large language models through context-based knowledge delivery have produced inconsistent results. Systems either fail to adopt injected capabilities reliably, treat injected knowledge as external reference material rather than actionable self-knowledge, or lose the information entirely when context windows are exceeded. We identify two previously unrecognized factors as the primary determinants of injection success: epistemic framing, the structural positioning of knowledge within a trusted first-person memory space; and authorship fidelity, the degree to which injected content matches the model’s own generative voice and style. Using the STAR (Structured Tree with Active Retrieval) persistent memory framework deployed on a 4-billion parameter Gemma 4 (E4B variant) model running locally on a consumer smartphone, we demonstrate that capabilities entirely absent from a model’s training data can be reliably injected, adopted autonomously, and applied without explicit prompting. The model demonstrated unprompted use of a custom device control capability from a vague natural language instruction, and correctly understood and described the intended behavior of a web search capability despite incomplete pipeline execution. These findings suggest that the boundary between trained and injected knowledge is substantially more permeable than previously assumed, and that structured persistent memory systems represent a practical, infrastructure-free alternative to fine-tuning for capability extension across models of any scale.

Capability Injection Without Retraining

Key Points

Abstract

Cite This Study

Also Consider

Also Consider