Multi Small-Object Tracking (MSOT) is crucial for drone inspection and intelligent monitoring, yet traditional Multiple-object Tracking (MOT) methods perform poorly in such scenarios. The reasons include the following: small targets have low resolution and sparse features, leading to high missed detection rates; frequent occlusion and motion blur in dense scenes cause trajectory interruption and identity switches. To address these issues, an MSOT method combining dual motion modeling and dynamic Region of Interest (ROI) detection is proposed. The dual motion framework integrates Kalman filtering and optical flow through dynamic weighting to optimize target state estimation. The Kalman filter-guided dynamic ROI mechanism, combined with multi-feature fusion, enables trajectory recovery when targets are lost. Experiments on the VisDrone-MOT and UAVDT datasets show that this method outperforms mainstream algorithms in core metrics such as MOTA and HOTA, with better trajectory continuity and identity consistency while maintaining good real-time performance.
Ma et al. (Sun,) studied this question.