Soil contamination in South African agriculture poses escalating threats to food security and ecosystem integrity, yet the geospatial and machine learning evidence base addressing this problem has never been systematically synthesised. This scoping review, conducted within the PRISMA-ScR framework, applied SVM-assisted screening to 2000 retrieved records, yielding a final corpus of 228 eligible studies published from 2003 to 2025. To characterise temporal, thematic, and geographic patterns in the corpus, we applied machine learning-assisted topic modelling (LDA, k = 7), logistic growth modelling, keyword co-occurrence network analysis, and technology–contaminant evidence gap matrices. Remote sensing was the dominant methodology throughout the review period (n = 142; 62.3% of studies), with machine learning rising to the highest adoption rank from approximately 2020 onwards. Logistic modelling estimated a carrying capacity of K = 292.3 (95% CI: 269–324) studies and an inflexion year of 2020.2 (95% CI: 2019.4–2021.1), projecting 90% saturation by 2028. Research effort was highly concentrated in KwaZulu-Natal and the Eastern Cape, while Pesticides/Herbicides and acid mine drainage each comprised only three corpus studies. Deep learning registered zero entries across all cells of both the technology–contaminant and technology–province evidence matrices. Targeted investment in field validation, hyperspectral and deep learning deployment for underrepresented contaminants, and interpretable modelling for regulatory defensibility are identified as priority actions for the next research cycle.
Nxumalo et al. (Fri,) studied this question.