R-CNNs for Pose Estimation and Action Detection

doi:10.48550/arXiv.1406.5212

R-CNNs for Pose Estimation and Action Detection

We present convolutional neural networks for the tasks of keypoint (pose) prediction and action classification of people in unconstrained images. Our approach involves training an R-CNN detector with loss functions depending on the task being tackled. We evaluate our method on the challenging PASCAL VOC dataset and compare it to previous leading approaches. Our method gives state-of-the-art results for keypoint and action prediction. Additionally, we introduce a new dataset for action detection, the task of simultaneously localizing people and classifying their actions, and present results using our approach.

Publication:

arXiv e-prints

Pub Date:

June 2014

DOI:

10.48550/arXiv.1406.5212

arXiv:

arXiv:1406.5212

Bibcode:

2014arXiv1406.5212G

Keywords:

Computer Science - Computer Vision and Pattern Recognition

NASA/ADS

R-CNNs for Pose Estimation and Action Detection

Abstract