Unsupervised Data Imputation via Variational Inference of Deep Subspaces

doi:10.48550/arXiv.1903.03503

Unsupervised Data Imputation via Variational Inference of Deep Subspaces

A wide range of systems exhibit high dimensional incomplete data. Accurate estimation of the missing data is often desired, and is crucial for many downstream analyses. Many state-of-the-art recovery methods involve supervised learning using datasets containing full observations. In contrast, we focus on unsupervised estimation of missing image data, where no full observations are available - a common situation in practice. Unsupervised imputation methods for images often employ a simple linear subspace to capture correlations between data dimensions, omitting more complex relationships. In this work, we introduce a general probabilistic model that describes sparse high dimensional imaging data as being generated by a deep non-linear embedding. We derive a learning algorithm using a variational approximation based on convolutional neural networks and discuss its relationship to linear imputation models, the variational auto encoder, and deep image priors. We introduce sparsity-aware network building blocks that explicitly model observed and missing data. We analyze proposed sparsity-aware network building blocks, evaluate our method on public domain imaging datasets, and conclude by showing that our method enables imputation in an important real-world problem involving medical images. The code is freely available as part of the \verb|neuron| library at http://github.com/adalca/neuron.

Publication:

arXiv e-prints

Pub Date:

March 2019

DOI:

10.48550/arXiv.1903.03503

arXiv:

arXiv:1903.03503

Bibcode:

2019arXiv190303503D

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning

NASA/ADS

Unsupervised Data Imputation via Variational Inference of Deep Subspaces

Abstract