Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

doi:10.48550/arXiv.1703.10960

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

While recent neural encoder-decoder models have shown great promise in modeling open-domain conversations, they often generate dull and generic responses. Unlike past work that has focused on diversifying the output of the decoder at word-level to alleviate this problem, we present a novel framework based on conditional variational autoencoders that captures the discourse-level diversity in the encoder. Our model uses latent variables to learn a distribution over potential conversational intents and generates diverse responses using only greedy decoders. We have further developed a novel variant that is integrated with linguistic prior knowledge for better performance. Finally, the training procedure is improved by introducing a bag-of-word loss. Our proposed models have been validated to generate significantly more diverse responses than baseline approaches and exhibit competence in discourse-level decision-making.

Publication:

arXiv e-prints

Pub Date:

March 2017

DOI:

10.48550/arXiv.1703.10960

arXiv:

arXiv:1703.10960

Bibcode:

2017arXiv170310960Z

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence

E-Print:

Appeared in ACL2017 proceedings as a long paper. Correct a calculation mistake in Table 1 E-bow &amp

NASA/ADS

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

Abstract