March 18, 2024Open Access

3D Pose Estimation from Monocular Video with Camera-Bone Angle Regularization on the Image Feature

Key Points

Key points are not available for this paper at this time.

Abstract

In this paper, we propose a monocular 3D pose estimation method which explicitly takes into account the angles between the camera optical axis and bones (camera-bone angles) as well as temporal information. The proposed method combines a 2D-to-3D-based method, which predicts a 3D pose from a sequence of 2D poses, and convolutional neural network (CNN) and includes novel regularization loss to enable the CNN to extract camera-bone-angle information. The camera-bone-angle and temporal information suppress ambiguity of 2D-to-3D-based methods where the same 2D pose can be mapped to multiple 3D poses. Experiments on the Human3.6M and MPI-INF-3DHP datasets showed that the proposed method improved the performance by 5.1 mm and 2.1 mm in terms of mean per joint position error (MPJPE) respectively.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Ishii et al. (Mon,) studied this question.

synapsesocial.com/papers/68e7398bb6db6435876b2c8f https://doi.org/https://doi.org/10.1109/icassp48485.2024.10446350

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark

View Full Paper