Variational Fusion for Multimodal Sentiment Analysis

doi:10.48550/arXiv.1908.06008

Variational Fusion for Multimodal Sentiment Analysis

Multimodal fusion is considered a key step in multimodal tasks such as sentiment analysis, emotion detection, question answering, and others. Most of the recent work on multimodal fusion does not guarantee the fidelity of the multimodal representation with respect to the unimodal representations. In this paper, we propose a variational autoencoder-based approach for modality fusion that minimizes information loss between unimodal and multimodal representations. We empirically show that this method outperforms the state-of-the-art methods by a significant margin on several popular datasets.

Publication:

arXiv e-prints

Pub Date:

August 2019

DOI:

10.48550/arXiv.1908.06008

arXiv:

arXiv:1908.06008

Bibcode:

2019arXiv190806008M

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Computation and Language;
Statistics - Machine Learning

NASA/ADS

Variational Fusion for Multimodal Sentiment Analysis

Abstract