Multi-Glimpse LSTM with Color-Depth Feature Fusion for Human Detection
Abstract
With the development of depth cameras such as Kinect and Intel Realsense, RGB-D based human detection receives continuous research attention due to its usage in a variety of applications. In this paper, we propose a new Multi-Glimpse LSTM (MG-LSTM) network, in which multi-scale contextual information is sequentially integrated to promote the human detection performance. Furthermore, we propose a feature fusion strategy based on our MG-LSTM network to better incorporate the RGB and depth information. To the best of our knowledge, this is the first attempt to utilize LSTM structure for RGB-D based human detection. Our method achieves superior performance on two publicly available datasets.
- Publication:
-
arXiv e-prints
- Pub Date:
- November 2017
- DOI:
- 10.48550/arXiv.1711.01062
- arXiv:
- arXiv:1711.01062
- Bibcode:
- 2017arXiv171101062L
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- ICIP 2017 Oral