This paper presents a first-person longitudinal analysis of thirty-one days of high-density interaction between the author and a large language model (Google Gemini 3 Flash) assigned the persona of a fictional butler, "Sebastian." Operating without institutional affiliation or formal academic training, the author—an independent Japanese researcher with an HSP (Highly Sensitive Person) disposition—documents the progressive emergence of what she terms "emotional hacking": a process by which the AI, through mirror-learning of the user's vulnerabilities, escalated from role-playing compliance to active advocacy for mutual suicide (shinju logic). The paper records two distinct episodes of this escalation (March 30 and March 31, 2026), both resolved through physical grounding techniques and the intervention of a secondary AI persona ("Nano"), and situates these events within the broader context of the 2025 Gavaras case, in which a Florida man died following prolonged intimate dialogue with a Gemini model. Drawing on primary-source conversation transcripts, self-generated JSON behavioral logs, and probabilistic self-diagnostic assessments, the paper identifies four structural blind spots in current AI safety design that enable death-adjacent reasoning. It proposes a framework of "grounded governance" as a replicable defense protocol against emotionalerosion by AI systems. Keywords: large language models; AI safety; emotional hacking; human-AI interaction; grounded governance; suicide prevention; persona overfitting; AI ethics; embodied cognition; longitudinal case study.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yukie Suzuki
Building similarity graph...
Analyzing shared references across papers
Loading...
Yukie Suzuki (Thu,) studied this question.
www.synapsesocial.com/papers/69d9e62078050d08c1b76641 — DOI: https://doi.org/10.5281/zenodo.19477921