What does this research mean for the field?

An inference-time negative-constraint framework, the AI Inversion Model, can mitigate systemic bias and extreme utilitarianism in medical AI by enforcing strict anti-moral boundaries and quantifying algorithmic good faith. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to address systemic biases in AI-driven medical decision-making by proposing an auditable framework.

May 29, 2026Open Access

The AI inversion model: a linear negative-constraint framework for auditable alignment in medical decision-making

Key Points

This research aims to address systemic biases in AI-driven medical decision-making by proposing an auditable framework.
Propose the AI Inversion Model using inference-time negative constraints.
Implement a Genesis anchor system for calibrating negative anchors based on legal and ethical standards.
Conduct surgical simulations to evaluate the model's efficacy in identifying biases.
The model detects utilitarian biases with a heuristic variance threshold of 10%.
It provides a framework for operationalizing the concept of 'functional compassion'.
While not a full solution, it begins to address the black-box dilemma in AI.

Abstract

Abstract The integration of artificial intelligence (AI) into healthcare systems is increasingly hindered by the AI alignment problem. In high-stakes domains such as clinical triage, algorithms frequently reflect and amplify systemic biases. Current alignment methodologies, including Reinforcement Learning from Human Feedback (RLHF), attempt to encode subjective human morality through opaque architectural pipelines, which can exacerbate the “black box” dilemma. To address this, this paper proposes the “AI Inversion Model”, a theoretical Proof-of-Concept (PoC) framework utilizing inference-time negative constraints. Rather than attempting to compute elusive positive virtues, the model draws clinical inspiration from the psychopathic spectrum - specifically extreme utilitarianism - to define strict “anti-moral” boundaries. These boundaries are operationalized through a “Genesis” anchor system, which autonomously and recursively calibrates negative anchors by synthesizing supreme legal precedents and ethical literature. Beyond static distance, the model evaluates the mathematical kinematics of AI outputs - including vector angles and directional trajectories - to quantify algorithmic “Good Faith”. A step-by-step trace of complex surgical simulations illustrates how the filter identifies utilitarian biases and enforces a human-in-the-loop protocol when variance exceeds a heuristic 10% threshold. While not a definitive resolution to the black-box problem, this model offers a targeted, auditable baseline of “functional compassion” to mitigate systemic bias in medical AI governance.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Eyal Cohen

Ariel University

Rachel Nissanholtz-Gannot

Myers-JDC-Brookdale Institute

Yehuda Adler

Ariel University

Journals

BMC Medical Ethics

Actions

Institutions

Ariel University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The AI inversion model: a linear negative-constraint framework for auditable alignment in medical decision-making

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study