MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection | Synapse