What question did this study set out to answer?

To develop a robust full-waveform inversion technique that addresses limitations of existing methods using a hybrid CNN and vision transformer architecture.

April 29, 2026Open Access

Robust Physics-Informed Reparameterized Full-Waveform Inversion via CNN-Enhanced Vision Transformer

Key Points

To develop a robust full-waveform inversion technique that addresses limitations of existing methods using a hybrid CNN and vision transformer architecture.
Introduced a physics-informed reparameterized FWI framework combining CNN and vision transformer with spatial-reduction attention.
Evaluated method on numerical tests with synthetic models to reconstruct velocity structures from low-quality initial models.
Demonstrated effectiveness on field data to assess seismic imaging quality and model reliability.
The method reliably reconstructs velocity structures from low-quality initial models, outperforming conventional FWI with improved metrics.
Demonstrated superior robustness under noise contamination and low-frequency-deficient conditions.
Field data showed that recovered velocity models enhance seismic imaging quality and support subsequent workflows.

Abstract

Abstract Full-waveform inversion (FWI) provides high-resolution subsurface characterization but remains vulnerable to ill-posedness, cycle skipping, and local minima when the starting model is inaccurate or low-frequency information is missing. We introduce a physics-informed reparameterized FWI framework that leverages a hybrid architecture combining convolutional neural network (CNN) and a vision transformer (ViT) enhanced with spatial-reduction attention (SRA), which reduces the computational cost while preserving global dependencies, to enhance robustness under challenging acquisition conditions. In the proposed scheme, the CNN extracts multi-shot local seismic attributes, whereas the ViT models long-range correlations and enforces structural coherence. The untrained nature of the hybrid network acts as an implicit regularization, enabling smooth and geologically plausible model updates while reducing the non-uniqueness of the inversion. Numerical tests on representative synthetic models demonstrate that the method reliably reconstructs velocity structures from low-quality initial models and outperforms conventional FWI and CNN-based FWI approaches, particularly under noise contamination and low-frequency-deficient data. The field data example further demonstrates that the recovered velocity models lead to improved seismic imaging quality and provide a more reliable foundation for subsequent imaging workflows.

Read Full Paperexternally

Perguntar à IA

Bookmark

View Full Paper