EdgeNet: Semantic Scene Completion from a Single RGB-D Image
Abstract
Semantic scene completion is the task of predicting a complete 3D representation of volumetric occupancy with corresponding semantic labels for a scene from a single point of view. Previous works on Semantic Scene Completion from RGB-D data used either only depth or depth with colour by projecting the 2D image into the 3D volume resulting in a sparse data representation. In this work, we present a new strategy to encode colour information in 3D space using edge detection and flipped truncated signed distance. We also present EdgeNet, a new end-to-end neural network architecture capable of handling features generated from the fusion of depth and edge information. Experimental results show improvement of 6.9% over the state-of-the-art result on real data, for end-to-end approaches.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2019
- DOI:
- 10.48550/arXiv.1908.02893
- arXiv:
- arXiv:1908.02893
- Bibcode:
- 2019arXiv190802893D
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition;
- I.4.6;
- I.4.8
- E-Print:
- 10 pages, 5 figures Accepted at ICPR 2020