Abstract Data quality is the foundational to scientific research and the rapid advancement of artificial intelligence. Ensuring data quality, provenance, and reproducibility requires robust mechanisms for traceability and accountability from the moment data are created. We propose the concept of a Data Birth Certificate, a universal framework for identifying research data at creation with built-in provenance information, including time, location, and data creator. Unlike existing identifiers assigned at deposition, a Data Birth Certificate establishes immutable, origin-centered traceability that complements established principles of Findability, Accessibility, Interoperability, and Reusability (FAIR). By capturing essential metadata at data generation, Data Birth Certificates support reliable data tracking, accountability, and downstream information management without constraining how data are stored or reused. This perspective outlines the conceptual framework, distinguishes it from existing identifier systems, and discusses its potential role in strengthening research reproducibility and data stewardship across scientific domains.
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Thu,) studied this question.
synapsesocial.com/papers/69dc89183afacbeac03eaceb — DOI: https://doi.org/10.1093/nargab/lqag037
Rongbin Li
The University of Texas Health Science Center at Houston
Avisha Das
Mayo Clinic Hospital
Yuntao Yang
The University of Texas Health Science Center at Houston
NAR Genomics and Bioinformatics
University of California, San Diego
Yale University
The University of Texas Health Science Center at Houston
Building similarity graph...
Analyzing shared references across papers
Loading...