We present evidence that a 14-billion parameter language model running on consumer hardware can reliably assess the quality of its own outputs against a structured rubric, without any external model, cloud service, or human scorer in the evaluation loop. The key methodological contribution is a calibration forcing function: a prompt-level technique that requires the model to identify a specific rubric gap before assigning the maximum score. Without this forcing function, the model exhibits uniform score inflation (all outputs scored 10/10 across four probes in the initial run). With the forcing function applied, scores distribute across four distinct values (6, 7, 8, 10) with rubric-grounded gap descriptions that are defensible by human evaluation on all nine valid assessments. Across four experimental runs on the same canonical verification battery, we observed three findings with implications for sovereign AI deployment. First, the self-assessor demonstrates superior contextual understanding over mechanical keyword scoring: it correctly identifies that a negated forbidden term does not constitute a violation, while keyword matching produces a false positive. Second, the self-assessor correctly penalises responses that contain required keywords but lack the reasoning depth the rubric demands. Third, the directional pattern of disagreement between self-assessment and keyword scoring is consistent and interpretable: self-assessment scores higher when keywords are wrong, and lower when responses lack reasoning substance that keywords cannot detect. These findings suggest that the dependency on frontier models or human scorers for quality assurance in locally deployed AI systems is not absolute.
Building similarity graph...
Analyzing shared references across papers
Loading...
Farah Jaber
Hôpital du Sacré-Cœur de Montréal
Building similarity graph...
Analyzing shared references across papers
Loading...
Farah Jaber (Mon,) studied this question.
synapsesocial.com/papers/69d5f09e74eaea4b11a7a13f — DOI: https://doi.org/10.5281/zenodo.19435861