Accurate monthly predictions of global horizontal irradiance (GHI) are essential for optimizing solar energy integration and ensuring stable power generation. This study proposes a novel hybrid framework for monthly GHI prediction over China that combines a numerical climate prediction model (JMA/MRI-CPS3) with Linear Regression and four machine learning techniques, including Decision Tree, Random Forest, Gradient Boosting Regression Trees, and Naive Bayes. The framework extracts statistical relationships between atmospheric circulation, external forcing fields, and GHI variability using ERA5 reanalysis and JMA/MRI-CPS3 hindcast datasets from February 1991 to January 2020. Empirical Orthogonal Function decomposition and correlation analysis are applied to identify the dominant spatial–temporal modes of GHI variability and their associated predictors. The predictive performance of each individual model, as well as their Ensemble Mean, is systematically evaluated. Among the individual models, the Decision Tree exhibits the best overall performance, with RMSE = 49.78 MJ/m², MAE = 38.67 MJ/m², and SCC = 0.86. The Ensemble Mean yields slightly lower skill (RMSE = 50.45 MJ/m², MAE = 39.47 MJ/m², SCC = 0.85) but outperforms the remaining individual models and demonstrates greater robustness, producing predictions highly consistent with those of the Decision Tree. Model validation against independent observations from February 2022 to January 2024 shows good overall agreement across most regions of China, with relative errors generally within 10%. Further station-based analysis reveals clear regional and seasonal contrasts in prediction skills. Predictions at the two northern stations (Ruoqiang and Harbin) show higher accuracy, with correlation coefficients of 0.97 and 0.95, compared to the two southern stations (Fuzhou and Kunming), where correlations decrease to 0.75 and 0.70. In addition, winter predictions exhibit the highest accuracy, while summer predictions show much larger errors. Overall, this hybrid framework appears to effectively capture the nonlinear interactions among key meteorological drivers of GHI, showing promising potential for reliable monthly solar radiation prediction.
Li et al. (Sun,) studied this question.