What question did this study set out to answer?

The study aims to assess machine learning models' accuracy in predicting soil salinity using satellite imagery data.

April 10, 2026Open Access

Mapping of salt-affected soil using machine learning and remote sensing in Raya Kobo Valley, Ethiopia

Key Points

The study aims to assess machine learning models' accuracy in predicting soil salinity using satellite imagery data.
Evaluated four machine learning regression models: Random Forest, Gradient Boosting Trees, Decision Tree, and Support Vector Machine.
Analyzed spectral indices derived from Landsat 8 OLI imagery.
Collected 33 soil samples from a total area of 939.46 hectares for validation.
Utilized Google Earth Engine and R software for analysis.
Gradient Boosting Trees and Random Forest achieved high R2 values of 0.93 and 0.902, respectively.
Classified the study area into slightly saline (31.2%), moderately saline (49.9%), and strongly saline (18.9%).
Both models demonstrated lower errors and effective spatial mapping capabilities.

Abstract

In the Abuarie-Addisalem irrigation scheme within the Raya Valley, many productive irrigated lands have become unproductive due to the effects of salinity and sodicity. This study aimed to evaluate the predictive performances of four machine learning regression models: Random Forest (RF), Gradient Boosting Trees (GBT), Decision Tree (DT), and Support Vector Machine (SVM). The specific objectives were to (1) identify the most effective spectral indices derived from Landsat 8 OLI imagery for soil salinity mapping, and (2) determine which model provides the most accurate predictions when integrated with these indices. A total of 33 surface soil samples (0–30 cm) were collected to represent 939.46 hectares of land. The analysis was performed trough Google Earth Engine and R software statistical and graphical techniques. Results indicated that GBT and RF models were the most efficient, particularly for the Modified Soil Adjusted Vegetation Index (MSR), achieving a high coefficient of determination (R2) value of 0.93 with GBT and 0.902 with RF. RF and GBT produced similar results, classified the study area into slightly saline (31.2%), moderately saline (49.9%), and strongly saline (18.9%). Both models demonstrated lower Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and efficient for spatial mapping, while DT and SVM exhibited limited performance. Therefore, RF and GBT models are recommended for practical application.

Bookmark

View Full Paper

Bookmark

View Full Paper

Mapping of salt-affected soil using machine learning and remote sensing in Raya Kobo Valley, Ethiopia

Key Points

Abstract

Cite This Study