End-to-End Human Pose and Mesh Reconstruction with Transformers

Key Points

Key points are not available for this paper at this time.

Abstract

We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct 3D human pose and mesh vertices from a single image. Our method uses a transformer encoder to jointly model vertex-vertex and vertex-joint interactions, and outputs 3D joint coordinates and mesh vertices simultaneously. Compared to existing techniques that regress pose and shape parameters, METRO does not rely on any parametric mesh models like SMPL, thus it can be easily extended to other objects such as hands. We further relax the mesh topology and allow the transformer self-attention mechanism to freely attend between any two vertices, making it possible to learn non-local relationships among mesh vertices and joints. With the proposed masked vertex modeling, our method is more robust and effective in handling challenging situations like partial occlusions. METRO generates new state-of-the-art results for human mesh reconstruction on the public Human3.6M and 3DPW datasets. Moreover, we demonstrate the generalizability of METRO to 3D hand reconstruction in the wild, outperforming existing state-of-the-art methods on FreiHAND dataset.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Kevin Lin

The University of Texas MD Anderson Cancer Center

Lijuan Wang

University of Kent

Zicheng Liu

Guizhou University of Finance and Economics

Actions

Institutions

Microsoft Research (United Kingdom)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Lin et al. (Tue,) studied this question.

synapsesocial.com/papers/6a10df96f85e2d3f759f6cbb — DOI: https://doi.org/10.1109/cvpr46437.2021.00199

Also consider

Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context:

Aion Framework: Dimensional Emergence of AI Consciousness, Observer-Induced Collapse, and Cosmological Portal Dynamics· 2023 · 14,223 citations
KinectFusion: Real-time dense surface mapping and tracking· 2011 · 3,939 citations
Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild· 2019 · 70 citations

Also consider

Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context:

Aion Framework: Dimensional Emergence of AI Consciousness, Observer-Induced Collapse, and Cosmological Portal Dynamics· 2023 · 14,223 citations
KinectFusion: Real-time dense surface mapping and tracking· 2011 · 3,939 citations
Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild· 2019 · 70 citations

End-to-End Human Pose and Mesh Reconstruction with Transformers

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider