Aiming at the problem that the traditional SRP-PHAT sound source localization method performs intensive search in a 360-degree space, resulting in high computational complexity and difficulty in meeting real-time requirements, an innovative high-precision sound source localization method is proposed. This method combines the selective SRP-PHAT algorithm with real-time visual analysis. Its core innovations include using face detection to dynamically determine the scanning angle range to achieve visually guided selective scanning, distinguishing face sound sources from background noise through a sound source classification mechanism, and implementing intelligent background orientation selection to ensure comprehensive monitoring of environmental noise. Experimental results show that the method achieves a positioning accuracy of ±5 degrees and a processing speed of more than 10FPS in complex real environments, and its performance is significantly better than the traditional full-angle scanning method.
Jing Huang (Thu,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: