What question did this study set out to answer?

The aim is to improve oriented object detection in RGB-infrared imagery by integrating semantic information during fusion.

March 2, 2026Open Access

SGFNet: Semantic-Guided Fusion Network with Closed-Loop Feedback for RGB-Infrared Oriented Object Detection

Key Points

The aim is to improve oriented object detection in RGB-infrared imagery by integrating semantic information during fusion.
Developed SGFNet with three main modules: Frequency-aware Disentanglement Module, Semantic-Guided Module, Adaptive Geometric Convolution.
Utilized RGB-IR datasets from DroneVehicle benchmark with 28,439 image pairs.
Applied detection-level feedback to enhance the fusion process for oriented objects.
Achieved 82.0% mean average precision (mAP) at 0.5 IoU, outperforming the previous leading method by 3.2 percentage points.
Reduced mean angular error from 7.4° to 6.2°, indicating a 16% improvement in accuracy.

Abstract

In oriented object detection from drone imagery, many existing RGB-infrared (RGB-IR) fusion methods derive modality weights from input statistics alone, without regard for downstream detection objectives. We present SGFNet, a Semantic-Guided Fusion Network that feeds detection-level semantics back into the fusion stage through learned importance masks. SGFNet comprises three modules: (1) a Frequency-aware Disentanglement Module (FDM) that separates high-frequency textures from low-frequency thermal structures through Laplacian and Gaussian filtering; (2) a Semantic-Guided Module (SGM) that generates P5-level semantic masks to steer fusion toward detection-critical regions; and (3) an Adaptive Geometric Convolution (AGC) whose rotation-aware sampling matches receptive fields to arbitrarily oriented objects. On the DroneVehicle benchmark (28,439 RGB-IR pairs, five vehicle categories), SGFNet achieves 82.0% mAP@0.5, surpassing the runner-up DMM by 3.2 percentage points while lowering mean angular error from 7.4° to 6.2° (−16%). Ablation analysis attributes the largest single-module gain (+1.7 pp) to the semantic feedback path.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Sat,) studied this question.

synapsesocial.com/papers/69a52dbff1e85e5c73bf0de8 https://doi.org/https://doi.org/10.3390/electronics15051003

Bookmark

View Full Paper