What question did this study set out to answer?

The aim is to propose a standard that ensures scientific claims are machine-readable, addressing existing verification challenges.

March 26, 2026Open Access

Paper as Specification: A Machine-Readable Standard for Scientific Claims

Key Points

The aim is to propose a standard that ensures scientific claims are machine-readable, addressing existing verification challenges.
Introduced Paper Spec v0.1.0, a YAML-based file for scientific papers.
Developed five verification layers (L0-L4) for analyzing scientific claims.
Analyzed a corpus of twenty papers to demonstrate the standard's capabilities.
Revealed dependency structures and self-citation patterns in papers.
Identified retraction propagation pathways that were not visible in unstructured formats.
Confirmed that paper.yaml improves verification and collaboration capabilities.

Abstract

The cost of generating scientific hypotheses has collapsed. Large language models can produce a thousand plausible research questions before dinner, yet the infrastructure for verifying, replicating, and building upon scientific claims remains essentially unchanged since the seventeenth century: unstructured prose, evaluated by overworked human reviewers, with no machine-readable representation of what is being claimed, how it was tested, or under what conditions it would be falsified. We argue that the verification crisis in science is, at its root, a specification crisis – the same class of problem that printing press pamphlet wars and early internet information overload represented in their respective eras. Each generation breakthrough in information production was resolved not by faster consumption but by a specification breakthrough: the scientific method formalized falsifiable prediction; structured data standards (HTML metadata, schema.org, knowledge graphs) made web content machine-parseable. The current generation breakthrough – LLM-driven hypothesis production – demands an equivalent specification response for scientific claims themselves. We introduce Paper Spec v0.1.0, a YAML-based standard that provides a machine-readable companion file (paper.yaml) for scientific papers. Paper Spec captures claims, methodology, acceptance criteria, results, dependencies, contradictions, and limitations in a format that is human-writable in thirty minutes, machine-parseable by any YAML processor, and incrementally adoptable (every section is optional except bibliographic metadata). We define five verification layers (L0-L4) that the standard enables, ranging from schema validation to cross-corpus dependency graph traversal and retraction cascade analysis. We demonstrate the standard on a corpus of twenty papers, revealing dependency structures, self-citation patterns, and retraction propagation pathways that are invisible in the unstructured originals. We advance four propositions concerning the expressiveness, verification utility, dependency analysis capability, and cognitive value of the standard, and discuss implications for research assessment, peer review, and the use of paper.yaml as an independent pre-registration and collaboration artifact.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Dmitry Zharnikov (Tue,) studied this question.

synapsesocial.com/papers/69c4cd65fdc3bde448919abd https://doi.org/https://doi.org/10.5281/zenodo.19210038

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark

View Full Paper