What type of study is this?

September 10, 2025Open Access

Enhancing YOLOv11 with Large Kernel Attention and Multi-Scale Fusion for Accurate Small and Multi-Lesion Bone Tumor Detection in Radiographs

Key Points

YOLOv11-MTB achieves a mean average precision (mAP) of 79.6% on the BTXRD dataset, indicating superior performance.
For small lesions, the model reaches an mAP of 55.8%, while multi-lesion detection achieves 63.2%, showing significant effectiveness.
The framework integrates multi-scale transformer-based attention and boundary-aware feature fusion, enhancing detection capacity.
Promising generalization and accuracy of YOLOv11-MTB suggests viable applications in clinical bone tumor diagnosis.

Abstract

Objectives: Primary bone tumors such as osteosarcoma and chondrosarcoma are rare but aggressive malignancies that require early and accurate diagnosis. Although X-ray radiography is a widely accessible imaging modality, detecting small or multi lesions remains challenging. Existing deep learning models are often trained on small, single-center datasets and lack generalizability, limiting their clinical effectiveness. Methods: We propose the YOLOv11-MTB, a novel enhancement to YOLOv11 integrating multi-scale Transformer-based attention, boundary-aware feature fusion, and receptive field augmentation to improve detection of small and multi-focal lesions. The model is trained and evaluated on two multi-center datasets, including the BTXRD dataset containing annotated radiographs with lesion types and bounding boxes. Results: YOLOv11-MTB achieves state-of-the-art performance on bone tumor detection tasks. It attains a mean average precision (mAP) of 79.6% on the BTXRD dataset, outperforming existing methods. In clinically relevant categories, the model achieves small-lesion mAP of 55.8% and multi-lesion mAP of 63.2%. Conclusions: The proposed YOLOv11-MTB framework demonstrates promising generalization and accuracy for primary bone tumor detection in radiographic images. Its performance in detecting small and multiple lesions suggests potential for clinical application.

Enhancing YOLOv11 with Large Kernel Attention and Multi-Scale Fusion for Accurate Small and Multi-Lesion Bone Tumor Detection in Radiographs

Key Points

Abstract

Cite This Study