Multimodal sensor fusion in the latent representation space
Abstract
A new method for multimodal sensor fusion is introduced. The technique relies on a two-stage process. In the first stage, a multimodal generative model is constructed from unlabelled training data. In the second stage, the generative model serves as a reconstruction prior and the search manifold for the sensor fusion tasks. The method also handles cases where observations are accessed only via subsampling i.e. compressed sensing. We demonstrate the effectiveness and excellent performance on a range of multimodal fusion experiments such as multisensory classification, denoising, and recovery from subsampled observations.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2022
- DOI:
- 10.48550/arXiv.2208.02183
- arXiv:
- arXiv:2208.02183
- Bibcode:
- 2022arXiv220802183P
- Keywords:
-
- Computer Science - Artificial Intelligence;
- Computer Science - Human-Computer Interaction;
- Computer Science - Machine Learning;
- Electrical Engineering and Systems Science - Signal Processing;
- 68Txx;
- I.2.4;
- H.1.2;
- I.2.6
- E-Print:
- Under review for Nature Scientific Reports