What question did this study set out to answer?

This research aims to enhance pose estimation accuracy on blurred images while ensuring privacy protection.

May 31, 2026Open Access

An improved YOLO-Pose model for pose estimation on blurred images generated to protect personal privacy

Puntos clave

This research aims to enhance pose estimation accuracy on blurred images while ensuring privacy protection.
Developed a deconvolution-based upsampling module and custom blurred data augmentation strategy.
Implemented a universal skeleton connection method to adapt to various datasets with different key points.
Trained the model on sharp images and tested it on blurred inputs.
Achieved +4.1% improvement in mAP@50:95 for Gaussian blur, with overall accuracy boosted.
+15.4% improvement in mAP@50:95 for motion blur during testing.
+6.8% improvement in mAP@50:95 for defocus blur under severe degradation.

Resumen

Recent years have witnessed significant advancements in multi-person pose estimation within the You Only Look Once (YOLO) framework. However, human body images are frequently blurred and anonymized to address privacy concerns, which significantly undermines the accuracy and reliability of pose estimation. To overcome these limitations, this article proposes an optimization program for YOLO-Pose, enabled by flexible structural configurations and custom training parameters to enhance adaptability. Specifically, a deconvolution-based upsampling module and a specialized blurred data augmentation strategy are introduced to improve the model’s robustness and generalization. Notably, the proposed model, even when trained exclusively on sharp images, demonstrates superior predictive performance on blurred inputs. Furthermore, we design a universal skeleton connection method that enables YOLO-Pose to seamlessly adapt to datasets with varying numbers of key points, significantly increasing its versatility across diverse annotation standards. Experimental results on the CrowdPose dataset demonstrate the superiority of the proposed method. While maintaining a parameter count nearly identical to that of the self-trained YOLO12n-Pose baseline, our model achieves relative improvements of +4.1%, +15.4%, and +6.8% in mAP@50:95 on test sets corrupted by Gaussian blur, motion blur, and defocus blur respectively, under the most severe degradation levels. The optimized model demonstrates robust and accurate pose estimation directly on blurred input images with varying intensities, highlighting its strong generalization capability under privacy-preserving visual conditions.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Yu et al. (Fri,) studied this question.

synapsesocial.com/papers/6a1bd12d5783ba022b6fcbdf https://doi.org/https://doi.org/10.7717/peerj-cs.3918

Me gusta

Guardar

Ver artículo completo