Flash floods arise from the interaction of rugged topography, short-duration intense rainfall, and rapid flow concentration. Conventional risk mapping often builds empirical indices with expert-assigned weights or trains supervised models on historical event inventories—approaches that degrade in data-scarce regions. We propose a fully data-driven, unsupervised Geographic Information System (GIS) framework based on fractional order k-means, which clusters multi-dimensional geospatial features without labeled flood records. Five raster layers—elevation, slope, aspect, 24 h maximum rainfall, and distance to the nearest stream—are normalized into a feature vector for each 30 m × 30 m grid cell. In a province-scale case study of Zhejiang, China, the resulting risk map aligns strongly with the observations: 95% of 1643 documented flash flood sites over the past 60 years fall within the combined high- and medium-risk zones, and 65% lie inside the high-risk class. These outcomes indicate that the fractional order distance metric captures physically realistic hazard gradients while remaining label-free. Because the workflow uses commonly available GIS inputs and open-source tooling, it is computationally efficient, reproducible, and readily transferable to other mountainous, data-poor settings. Beyond reducing subjective weighting inherent in index methods and the data demands of supervised learning, the framework offers a pragmatic baseline for regional planning and early-stage screening.
Li et al. (Thu,) studied this question.