What question did this study set out to answer?

This research aims to enhance pavement crack detection through an advanced transformer-based architecture that addresses the limitations of CNNs.

March 26, 2026Open Access

AutoCrack: Deep Transformer With Pyramid Pooling for Autonomous Pavement Crack Identification

Puntos clave

This research aims to enhance pavement crack detection through an advanced transformer-based architecture that addresses the limitations of CNNs.
Developed a pyramid pooling–powered transformer backbone network for crack detection.
Utilized various pooling techniques to create feature maps with different strides and receptive fields.
Concatenated output data from multiple pooling layers to form a final pooled feature map.
The proposed model outperformed state-of-the-art techniques in crack detection.
Improvements were noted in precision, recall, and F-measure compared to existing models.

Resumen

A frequent problem with engineering constructions such as roads and buildings is surface cracks. The development of deep learning algorithms has significantly enhanced the ability to automatically detect surface cracks. Although recently developed transformer topologies may offer advantages, convolutional neural networks (CNNs) remain the most common method for this type of research. CNNs used as feature extractors can thoroughly explore the local connections of image blocks, which aid in enhancing detection performance, but fail to capture global dependencies within image blocks. The transformer’s ability to thoroughly examine global dependencies on sequential data has recently drawn attention. However, the transformer has a significant computational cost due to its attention mechanism. In this research, we construct a pyramid pooling–powered transformer backbone network for crack detection, utilizing several pooling techniques to generate feature maps with varying strides and receptive fields. The final pooled feature map is created by concatenating the output data from each pooling layer. The proposed architecture thus captures the features more robustly compared to the existing techniques, which enhances the crack detection accuracy. Systematic experiments demonstrate that the proposed model outperforms the chosen state‐of‐the‐art baselines in the pavement crack detection task in terms of precision, recall, and F‐measure.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo