Populating an open repository with high-quality, consistent metadata is a substantial task. The challenge becomes even harder when records also need to be enriched with the corresponding full text at scale, particularly when authors are not involved in deposit workflows. In 2025, the University of Galway and Atmire developed and deployed a DOI-driven workflow to enrich metadata-only repository records with open full text links. The tool queries multiple open services using the DOI, selects the most credible full text candidate, and records both provenance and outcomes to support review and reporting. In production, this approach identified and attached thousands of full text PDFs with minimal manual intervention, while surfacing cases that require follow-up due to redirects, inconsistent landing pages, or unclear licensing signals. The implementation is designed to be extensible, with additional sources and local policy rules added as needed. The session will demonstrate the Google Apps Script and Google Sheets version, describe key design trade-offs (accuracy, coverage, validation, and rate limiting), and share an approach that other repository teams can adapt to their own infrastructure. Currently supported sources include OpenAIRE, Unpaywall, CORE, and OpenAlex.
Joy et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: