What question did this study set out to answer?

The aim is to investigate when governance can be classified as good or bad based on limited observational data.

February 5, 2026Open Access

When ``Good vs. Bad Governance'' Is Unidentifiable

Key Points

The aim is to investigate when governance can be classified as good or bad based on limited observational data.
Formalized governance unidentifiability within observable-only agents.
Defined observational-equivalence classes over mediation mechanisms.
Presented an impossibility theorem related to necessity-evaluators.
Introduced non-arbitrariness requirements linked to contestability and control-domain independence.
Demonstrated a scenario where no feasible policy can ensure a positive robust lower bound.
Established minimal structural counterexamples illustrating governance challenges.
Outlined boundaries to manage risks associated with dual-use scenarios.

Abstract

This supplement studies a core limitation of observable-only, no-meta agents under exit-impossibility: the distinction between “good” and “bad” governance can be unidentifiable when mediator-side implementations are observationally equivalent from the agent’s local history (optionally including verification gates). We formalize governance unidentifiability as observational-equivalence classes over mediation mechanisms and define value-neutral robust progress as a minimax lower bound over a family of evaluation functionals, including necessity-evaluators that depend on necessity/viability trajectories rather than a single moral axis. Main results include an impossibility theorem: if the admissible model class contains an observationally unrefutable “floor-failure” regime for some necessity-evaluator, then no feasible policy can guarantee a strictly positive robust lower bound. We then package the exclusion of that impossibility premise as a non-arbitrariness requirement and connect it operationally to contestability, retaliation-resistant right-to-refuse (safe-default modes whose future feasibility cannot be silently destroyed), and control-domain independence (cross-domain witnesses) that break silent contract switching. The supplement also provides minimal structural counterexamples (shared roots, readout capture, retaliation) and delineates a publishable-vs-sensitive boundary to reduce dual-use risk.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

K Takahashi

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

When ``Good vs. Bad Governance'' Is Unidentifiable

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study