What question did this study set out to answer?

May 7, 2026Open Access

Automated detection of freeze-thaw signatures in archaeological sediments using deep learning

Key Points

The aim is to automate the detection of freeze-thaw signatures in archaeological sediments using deep learning techniques.
Trained five convolutional neural network architectures on photomicrographs from eleven Plio-Pleistocene archaeological sites.
Implemented a two-stage classification approach focusing on the presence and feature type of freeze-thaw signatures.
Validated model outputs against expert analysis and performed a blind survey of micromorphologists.
Models achieved high accuracy in detection but primarily relied on spurious correlations, not diagnostic criteria.
Expert agreement on feature detection was low, indicating high variability and uncertainty in results.
Errors from models and experts were largely independent, highlighting the complementarity of approaches for effective frost feature recognition.

Abstract

Background Freeze-thaw processes leave diagnostic traces in archaeological soils and sediments that are central to reconstructing past climates and understanding hominin adaptations to glacial environments. Identifying these features through thin section micromorphology is well established, but these traces can be subtle, variably expressed, and overlap with other pedogenic processes, making their identification time-consuming, expert-dependent, and subject to high inter-observer variability. Methods We trained five convolutional neural network architectures on photomicrographs from eleven Plio-Pleistocene archaeological sites, implementing a two-stage classification approach (first presence, then feature type), and validated model outputs against both interpretability analysis and a blind survey of practicing micromorphologists. Results Results reveal a performance paradox: models achieve high performance but rely on spurious correlations rather than diagnostic criteria, while models that focus on micromorphologically relevant features show lower overall performance. Expert agreement on the same task is low, with uncertainty concentrated in feature detection rather than classification. Crucially, model and expert errors are largely independent, and each captures different aspects of frost feature recognition, establishing a basis for complementarity. Conclusions These findings demonstrate that effective computational integration in micromorphology requires not only accurate classification but interpretability validation ensuring that model’s reason from the same diagnostic criteria as experts. We propose a human-in-the-loop approach where models provide consistent first screening, while experts offer contextual interpretation and diagnostic validation. Additionally, we present an interactive open-access tool that implements this pipeline to facilitate adoption and repeatability.

Automated detection of freeze-thaw signatures in archaeological sediments using deep learning

Key Points

Abstract

Cite This Study