What does this research mean for the field?

The proposed dynamic prototype fusion network (PFN) significantly improves test-time few-shot object detection performance compared to existing state-of-the-art methods. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The research aims to improve few-shot object detection performance by addressing the challenges of limited data and category shifts during test time.

February 28, 2026

Test-Time Few-Shot Object Detection via Dynamic Prototype Fusion

Puntos clave

The research aims to improve few-shot object detection performance by addressing the challenges of limited data and category shifts during test time.
Developed a dynamic prototype fusion network to refine object prototypes adaptively.
Implemented a dual-level multiscale integration approach for better information fusion.
Introduced a mask-based preprocessing technique using segmentation labels to minimize background noise.
Kept model parameters fixed during testing while updating only prototypes with new support samples.
Achieved superior performance compared to existing state-of-the-art FSOD methods.
Demonstrated effective reduction in the negative impact of distribution shifts.
Showed enhanced discriminating capabilities by utilizing multiscale information integration.

Resumen

Test-time few-shot object detection (FSOD) represents an innovative approach for identifying novel categories using a limited number of support examples, obviating the need for model fine-tuning. Despite advancements, existing FSOD methods, including our prior work, continue to grapple with challenges posed by domain/category shift and limited data availability. Building upon our previous research on test-time FSOD, this article proposes a novel dynamic prototype fusion network (PFN) to overcome these limitations. To mitigate the impact of the distribution shift, a dynamic prototype refinement method is introduced that updates prototypes from supporting images in an adaptive manner. Further, limited samples are mitigated through exhaustive exploitation of information within support images. Specifically, we design a dual-level multiscale information integration approach that effectively fuses information across different network layers and image scales, enhancing the model's discriminating capabilities. Additionally, a mask-based preprocessing technique harnesses segmentation labels on support samples, effectively suppressing the adverse impact of background noise on model accuracy. Notably, to align with the constraints of test-time scenarios, model parameters remain fixed during the configuration step, with only prototypes being updated each time users input novel supporting samples. As a result, our method achieves superior performance over existing state-of-the-art FSOD methods on multiple benchmarks, demonstrating remarkable potential in the realm of FSOD. The code is available at https://github.com/CatfishW/TIDEV2.

Me gusta

Guardar

Me gusta

Guardar

Test-Time Few-Shot Object Detection via Dynamic Prototype Fusion

Puntos clave

Resumen

Cite This Study