Key points are not available for this paper at this time.
The number of publications in biomedicine and life sciences has grown so much that it is difficult to keep track of new scientific works and to have an overview of the evolution of the field as a whole. Here, we present a two-dimensional (2D) map of the entire corpus of biomedical literature, based on the abstract texts of 21 million English articles from the PubMed database. To embed the abstracts into 2D, we used the large language model PubMedBERT, combined with t-SNE tailored to handle samples of this size. We used our map to study the emergence of the COVID-19 literature, the evolution of the neuroscience discipline, the uptake of machine learning, the distribution of gender imbalance in academic authorship, and the distribution of retracted paper mill articles. Furthermore, we present an interactive website that allows easy exploration and will enable further insights and facilitate future research.
Building similarity graph...
Analyzing shared references across papers
Loading...
Rita González-Márquez
Luca Schmidt
Benjamin M. Schmidt
Patterns
Heidelberg University
University of Tübingen
Hertie Institute for Clinical Brain Research
Building similarity graph...
Analyzing shared references across papers
Loading...
González-Márquez et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68e6fb9db6db643587676361 — DOI: https://doi.org/10.1016/j.patter.2024.100968
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: