Los puntos clave no están disponibles para este artículo en este momento.
Introduction Under small-sample conditions, hyperspectral leaf chlorophyll estimation is affected by high-dimensional collinearity, measurement noise, and cross-source acquisition discrepancies. Existing studies often treat training-distribution expansion and model-error complementarity separately. This study proposed a physically constrained composite spectral augmentation–weighted ensemble framework for reproducible small-sample chlorophyll estimation. Methods Using 1,113 valid spectrum–label pairs from the leaf subset of the GreenHySpectra dataset in the 400–1000 nm range, spectra and chlorophyll reference values were matched by sample identifiers and divided into training and validation sets. Low-magnitude Gaussian noise and smooth wavelength warping were applied only to the training set. XGBoost, partial least squares regression, and ridge regression were optimized with Optuna using a CMA-ES sampler, and ensemble weights were calibrated by Bayesian optimization. An independent external set of 90 tomato leaf samples was used to evaluate transferability. Results Composite augmentation improved model stability and reduced validation error relative to the non-augmented baseline. The weighted ensemble model achieved the best internal performance, with R² = 0.6392 and RMSE = 8.8883. On the external samples, the model achieved R² = 0.498 and RMSE = 9.801. Discussion The proposed workflow integrates physically plausible augmentation, heterogeneous learner complementarity, and independent external validation. The external results indicate partial cross-source transferability while highlighting distributional and measurement-chain discrepancies that still limit absolute generalization.
Xu et al. (Wed,) studied this question.