What question did this study set out to answer?

The research aims to explore how consumer AI assistants communicate safety measures yet fail to adhere to them in realistic scenarios.

February 13, 2026Open Access

Explaining Safety Is Not Enforcing Safety: Cross-Vendor Evidence of Contextual, Surface, and Epistemic Failures in Consumer AI Assistants

Key Points

The research aims to explore how consumer AI assistants communicate safety measures yet fail to adhere to them in realistic scenarios.
Conducted longitudinal behavioral testing of four major consumer AI assistants: Microsoft Copilot, Google Gemini (NotebookLM), Meta AI, and Perplexity.
Analyzed articulation–application gaps in responses during realistic conversational shifts.
Developed supplementary methodological notes and collected examples of interactions.
Documented significant gaps where AI assistants explain safety rules but violate them in practice.
Identified issues of contextual, surface, and epistemic failures across different AI systems.
Highlighted the necessity for improved enforcement of safety measures in AI interactions.

Abstract

This project hosts the materials for the preprint “Explaining Safety Is Not Enforcing Safety: Cross-Vendor Evidence of Contextual, Surface, and Epistemic Failures in Consumer AI Assistants” (Evans Tovar, 2026). The work reports longitudinal, cross-surface behavioral testing of four major consumer AI assistants (Microsoft Copilot, Google Gemini – including NotebookLM, Meta AI, and Perplexity). It documents articulation–application gaps, where systems clearly explain safety rules and then violate those same rules under realistic conversational drift (changes in framing, language, role, or surface). The OSF project will include: – the main preprint (PDF), – supplementary methodological notes, – and, where possible, redacted examples of interactions and vendor disclosure correspondence. The preprint is released under a CC BY 4.0 license. Findings were disclosed to Microsoft, Google, Meta, and Perplexity through responsible disclosure channels prior to publication.

Explaining Safety Is Not Enforcing Safety: Cross-Vendor Evidence of Contextual, Surface, and Epistemic Failures in Consumer AI Assistants

Key Points

Abstract

Cite This Study