Accurate classification of uranium slag origin and category is essential for nuclear environmental monitoring and safety. This study presents a hybrid framework combining laser-induced breakdown spectroscopy (LIBS), four preprocessing methods, and five machine learning algorithms for rapid uranium slag classification. A total of nine sample categories were collected from three mining areas, with categories defined by their U concentration levels within each origin. Standard normal variate (SNV), Savitzky–Golay smoothing (SG), and their combinations (SNV-SG, SG-SNV) were applied to evaluate preprocessing effects. To address ultra-high-dimensional spectral data (49,242 points per spectrum), principal component analysis (PCA) and random forest (RF) were employed for feature engineering, integrated with support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbors (KNN) classifiers. Hyperparameter optimization via five-fold cross-validation and Bayesian optimization enhanced accuracy and efficiency. RF-based hybrid models consistently outperformed PCA-based counterparts. Remarkably, the RF-LDA model with SNV-SG preprocessing achieved 100% classification accuracy across all test sets with a processing time of only 10.46 s, demonstrating exceptional discriminative power and computational efficiency. These findings establish that combining RF feature selection with advanced machine learning offers a robust solution for LIBS-based nuclear material classification, with significant implications for both nuclear safety and resource management.
Zhang et al. (Sun,) studied this question.