Key points are not available for this paper at this time.
Due to corrosion characteristics, there are data scarcity and uneven distribution in corrosion datasets, and collecting high-quality data is time-consuming and sometimes difficult. Therefore, this work introduces a novel data augmentation strategy using a conditional tabular generative adversarial network (CTGAN) for enhancing corrosion datasets of pipelines. Firstly, the corrosion dataset is subjected to data cleaning and variable correlation analysis. The CTGAN is then used to generate external environmental factors as input variables for corrosion growth prediction, and a hybrid model based on machine learning is employed to generate corrosion depth as an output variable. The fake data are merged with the original data to form the synthetic dataset. Finally, the proposed data augmentation strategy is verified by analyzing the synthetic dataset using different visualization methods and evaluation indicators. The results show that the synthetic and original datasets have similar distributions, and the data augmentation strategy can learn the distribution of real corrosion data and sample fake data that are highly similar to the real data. Predictive models trained on the synthetic dataset perform better than predictive models trained using only the original dataset. In comparative tests, the proposed strategy outperformed other data generation methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Haonan Ma
Line Corporation (Japan)
Mengying Geng
University of Science and Technology Beijing
Fan Wang
Joint Institute for Computational Sciences
Materials
University of Science and Technology Beijing
Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou)
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai)
Building similarity graph...
Analyzing shared references across papers
Loading...
Ma et al. (Fri,) studied this question.
synapsesocial.com/papers/68e761c9b6db6435876d7e7b — DOI: https://doi.org/10.3390/ma17051142