Graph Stacked Hourglass Networks for 3D Human Pose Estimation

doi:10.48550/arXiv.2103.16385

Graph Stacked Hourglass Networks for 3D Human Pose Estimation

In this paper, we propose a novel graph convolutional network architecture, Graph Stacked Hourglass Networks, for 2D-to-3D human pose estimation tasks. The proposed architecture consists of repeated encoder-decoder, in which graph-structured features are processed across three different scales of human skeletal representations. This multi-scale architecture enables the model to learn both local and global feature representations, which are critical for 3D human pose estimation. We also introduce a multi-level feature learning approach using different-depth intermediate features and show the performance improvements that result from exploiting multi-scale, multi-level feature representations. Extensive experiments are conducted to validate our approach, and the results show that our model outperforms the state-of-the-art.

Publication:

arXiv e-prints

Pub Date:

March 2021

DOI:

10.48550/arXiv.2103.16385

arXiv:

arXiv:2103.16385

Bibcode:

2021arXiv210316385X

Keywords:

Computer Science - Computer Vision and Pattern Recognition

E-Print:

Accepted to CVPR 2021

NASA/ADS

Graph Stacked Hourglass Networks for 3D Human Pose Estimation

Abstract