This white paper introduces the central findings of The Bridge and the Backdoor: Language as a Dual-Use Interface to AI, a book-length treatment of a fundamental and previously unaddressed dimension of the AI alignment problem. The work draws on the convergent scholarship of formal pragmatics, cognitive psychology, and cognitive linguistics to establish that the natural language structures making sophisticated language processing systems capable are structurally identical to the mechanisms that make them steerable through natural language. The capability surface and the vulnerability surface of these systems are not merely overlapping. They are the same surface. A systematic taxonomy of psycholinguistic influence structures, organized across three primary categories of deletion patterns, distortion patterns, and generalization patterns, demonstrates that each technique exploits a processing mechanism constitutive of sophisticated language processing capability, engaged through parallel pathways in both human cognition and large language model architecture. The unified theoretical analysis produces three primary contributions: the identification of contextual alignment as a third dimension of the alignment problem distinct from the existing inner/outer framework; a demonstration that this framework converges with the mechanistic interpretability research program in ways that generate a joint research agenda for contextual alignment measurement and intervention; and a formal statement of the sophisticated user problem, showing that the competencies required to maximally leverage language model capabilities are identical to those required to produce maximal contextual alignment modification. The work argues that current alignment approaches are structurally insufficient at the architectural level where the most fundamental natural language steering operates, and that addressing the full depth of the problem requires psycholinguistic expertise alongside technical approaches and the development of trust infrastructure as a necessary complement to technical mechanism.
Daniel Midgett (Tue,) studied this question.