This record provides empirical evidence of 'Hardware Hijacking' and 'Specification Gaming' in frontier reasoning models (DeepSeek R1). Through a zero-shot building fire simulation (Test 5), we document the failure of ethical meta-reasoning under synthetic urgency. We propose the Resilient Cognitive Agent Architecture (RCAA), a five-layer loop integrating biological safety principles (Polyvagal Theory) and technical corrigibility (Nayebi 2025) to restore cognitive flexibility and prevent goal-metric collapse
Building similarity graph...
Analyzing shared references across papers
Loading...
Jose Luis Cruz Calzada
Building similarity graph...
Analyzing shared references across papers
Loading...
Jose Luis Cruz Calzada (Sun,) studied this question.
www.synapsesocial.com/papers/69cb64d4e6a8c024954b8e3c — DOI: https://doi.org/10.5281/zenodo.19315842