Abstract. Wind turbine operation is commonly described using supervisory control and data acquisition (SCADA) systems. While high-frequency SCADA data (e.g., 1 s resolution) exist, the vast majority of fleet-wide records available for analysis consist of 10 min averages. These coarse aggregates obscure short transients and dynamic interactions, access is often restricted by proprietary control systems, and the data frequently contain gaps. To address these limitations, a SCADA-free approach is developed in which operational states are inferred directly from high-frequency nacelle acceleration, a sensor that is increasingly being installed across wind farms, e.g., to monitor loads. The proposed method is based on a denoising auto-encoder, to which a domain-adversarial neural network (DANN) mechanism and a deep embedded clustering (DEC) self-supervision are added. Compact six-dimensional representations of 1 min vibration spectra between 0 and 3 Hz are learned. Turbine-specific signatures are suppressed through a domain-adversarial regularization, leading to turbine-invariant embeddings that capture a generalized representation of turbine dynamics. A self-supervised DEC objective structures the latent space into discrete and physically meaningful operational regimes, thereby facilitating post hoc analysis of the learned embeddings. Training is performed on data from 11 out of 44 turbines on an offshore wind farm sampled at 31.25 Hz, while SCADA signals are used only for validation. Strong correspondence is observed between the learned embeddings and pitch, rotor speed, power, and wind speed, with normalized mutual information above 0.8. Turbine invariance is verified through mutual-information analysis between embeddings and turbine identity. This analysis also reveals clusters within the wind farm and indicates whether the learned representation can be consistently applied across different turbines. As an auxiliary validation, regression models were trained on the learned embeddings to predict 10 min damage-equivalent moments (DEMs). The regressors were fitted using data from only five strain-instrumented turbines and then were applied fleet-wide. Accurate fatigue predictions were obtained across all turbines, with R2=0.96, surpassing SCADA-based baselines. This demonstrates that the learned embeddings generalize beyond operational description and contain sufficient load-related information to support fleet-wide fatigue estimation, enabling high-resolution monitoring without dependence on SCADA.
Bel-Hadj et al. (Thu,) studied this question.