Ray-ONet: Efficient 3D Reconstruction From A Single RGB Image
Abstract
We propose Ray-ONet to reconstruct detailed 3D models from monocular images efficiently. By predicting a series of occupancy probabilities along a ray that is back-projected from a pixel in the camera coordinate, our method Ray-ONet improves the reconstruction accuracy in comparison with Occupancy Networks (ONet), while reducing the network inference complexity to O($N^2$). As a result, Ray-ONet achieves state-of-the-art performance on the ShapeNet benchmark with more than 20$\times$ speed-up at $128^3$ resolution and maintains a similar memory footprint during inference.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2021
- DOI:
- 10.48550/arXiv.2107.01899
- arXiv:
- arXiv:2107.01899
- Bibcode:
- 2021arXiv210701899B
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition
- E-Print:
- accepted in BMVC 2021