Los puntos clave no están disponibles para este artículo en este momento.
There are thousands of data repositories on the Web, providing access to millions of datasets. National and regional governments, scientific publishers and consortia, commercial data providers, and others publish data for fields ranging from social science to life science to high-energy physics to climate science and more. Access to this data is critical to facilitating reproducibility of research results, enabling scientists to build on others' work, and providing data journalists easier access to information and its provenance. In this paper, we discuss Google Dataset Search, a dataset-discovery tool that provides search capabilities over potentially all datasets published on the Web. The approach relies on an open ecosystem, where dataset owners and providers publish semantically enhanced metadata on their own sites. We then aggregate, normalize, and reconcile this metadata, providing a search engine that lets users find datasets in the “long tail” of the Web. In this paper, we discuss both social and technical challenges in building this type of tool, and the lessons that we learned from this experience.
Building similarity graph...
Analyzing shared references across papers
Loading...
Brickley et al. (Mon,) studied this question.
synapsesocial.com/papers/6a0ea336aa1655e5fb22971d — DOI: https://doi.org/10.1145/3308558.3313685
Dan Brickley
United Nations
Matthew Burgess
Community Arts Network
Natasha Noy
Google (United States)
Google (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...