Abstract— The deployment of Large Language Models (LLMs) in enterprise environments is currently bottlenecked by extreme resource overhead, variable API pricing, and a lack of technological sovereignty. This paper presents Delentia OS, an intent-centric AI operating system architecture designed to run entirely on local edge consumer hardware. Driven by the JITNA (RFC-001) protocol and an underlying mathematical governance framework (F = DI * A), Delentia OS utilizes a frozen 8-billion parameter base model coupled with a dynamic Low-Rank Adaptation (LoRA) swapping scheduler. Key Architectural Highlights: • Sub-12ms Dynamic LoRA Swapping: Hot-swaps four specialized cognitive adapters (Router, Guardian, Executor, Scribe) within local VRAM in under 12 milliseconds (actual avg. 11. 2ms). • Differential Memory Retention (Delta Engine): Implements ALGO-41 to compress long-term semantic memory, reducing context retrieval VRAM footprint by 74. 2% and lowering repeated query inference costs by up to 99. 4%. • Deterministic Mathematical Governance: Regulates autonomous agentic workflows via the FDIA authorization gate check (F = DI * A), guaranteeing absolute transaction-level safety. Empirical Verification & Benchmarks: • Zero-Crash Invariant Verification: Evaluated via property-based testing across 205, 999 regression examples with zero software crashes (0. 00% crash rate). • Absolute Execution Precision: Achieves a 0. 00% syntax error rate in structured JSON generation and automated tool-calling workflows. Keywords— Intent-Centric AI, Edge Computing, Dynamic LoRA Swapping, VRAM Optimization, Cognitive Operating System, SignedAI Consensus, Technological Sovereignty, JITNA Protocol, Small Language Models (SLM).
Ittirit Saengow (Fri,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: