End-to-End Instance Segmentation with Recurrent Attention

doi:10.48550/arXiv.1605.09410

End-to-End Instance Segmentation with Recurrent Attention

While convolutional neural networks have gained impressive success recently in solving structured prediction problems such as semantic segmentation, it remains a challenge to differentiate individual object instances in the scene. Instance segmentation is very important in a variety of applications, such as autonomous driving, image captioning, and visual question answering. Techniques that combine large graphical models with low-level vision have been proposed to address this problem; however, we propose an end-to-end recurrent neural network (RNN) architecture with an attention mechanism to model a human-like counting process, and produce detailed instance segmentations. The network is jointly trained to sequentially produce regions of interest as well as a dominant object segmentation within each region. The proposed model achieves competitive results on the CVPPP, KITTI, and Cityscapes datasets.

Publication:

arXiv e-prints

Pub Date:

May 2016

DOI:

10.48550/arXiv.1605.09410

arXiv:

arXiv:1605.09410

Bibcode:

2016arXiv160509410R

Keywords:

Computer Science - Machine Learning;
Computer Science - Computer Vision and Pattern Recognition

E-Print:

CVPR 2017

NASA/ADS

End-to-End Instance Segmentation with Recurrent Attention

Abstract