Open, non-governmental building datasets have become increasingly important for urban analysis, exposure modelling, and policy support. Despite their growing use, little is known about the consistency, completeness, and comparability of the semantic information they provide at a continental scale. This study presents the first systematic comparison of the semantic attributes of six major pan-European open building datasets—OpenStreetMap, EUBUCCO, Microsoft Global ML Building Footprints, Overture Maps, GHS-OBAT, and the Digital Building Stock Model (DBSM)—using the 27 EU Member States as a common reference area. Five key semantic attributes (height, typology, building age, number of floors, and building material) were harmonised and analysed in terms of completeness and value distributions across countries and degrees of urbanisation. The workflow combines API-based data ingestion, distributed geospatial processing, and high-performance computing to handle around 1.250 billion building footprints. Results reveal pronounced heterogeneity in semantic content across datasets. Remote-sensing-derived products (GHS-OBAT and DBSM) exhibit the highest levels of attribute completeness for height, typology, and building age, but rely on aggregated or coarse semantic representations. In contrast, community-driven and conflated datasets (OpenStreetMap and Overture Maps) provide richer and more detailed semantic schemas, albeit with low and spatially uneven completeness. Completeness patterns vary substantially across countries and urbanisation classes, and high completeness values often mask limited semantic informativeness due to the prevalence of unknown or aggregated attribute values. Overall, the findings demonstrate that no single dataset is universally optimal regarding consistency and completeness of building footprints’ semantic attributes. Nonetheless, the paper provides practical guidance for selecting suitable data sources depending on spatial scale, attribute requirements, and analytical objectives.
Gabrielli et al. (Thu,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: