Volumetric performance capture from minimal camera viewpoints

doi:10.48550/arXiv.1807.01950

Volumetric performance capture from minimal camera viewpoints

We present a convolutional autoencoder that enables high fidelity volumetric reconstructions of human performance to be captured from multi-view video comprising only a small set of camera views. Our method yields similar end-to-end reconstruction error to that of a probabilistic visual hull computed using significantly more (double or more) viewpoints. We use a deep prior implicitly learned by the autoencoder trained over a dataset of view-ablated multi-view video footage of a wide range of subjects and actions. This opens up the possibility of high-end volumetric performance capture in on-set and prosumer scenarios where time or cost prohibit a high witness camera count.

Publication:

arXiv e-prints

Pub Date:

July 2018

DOI:

10.48550/arXiv.1807.01950

arXiv:

arXiv:1807.01950

Bibcode:

2018arXiv180701950G

Keywords:

Computer Science - Computer Vision and Pattern Recognition

NASA/ADS

Volumetric performance capture from minimal camera viewpoints

Abstract