Dual Task Framework for Improving Persona-grounded Dialogue Dataset

doi:10.48550/arXiv.2202.05435

Dual Task Framework for Improving Persona-grounded Dialogue Dataset

This paper introduces a simple yet effective data-centric approach for the task of improving persona-conditioned dialogue agents. Prior model-centric approaches unquestioningly depend on the raw crowdsourced benchmark datasets such as Persona-Chat. In contrast, we aim to fix annotation artifacts in benchmarking, which is orthogonally applicable to any dialogue model. Specifically, we augment relevant personas to improve dialogue dataset/agent, by leveraging the primal-dual structure of the two tasks, predicting dialogue responses and personas based on each other. Experiments on Persona-Chat show that our approach outperforms pre-trained LMs by an 11.7 point gain in terms of accuracy.

Publication:

arXiv e-prints

Pub Date:

February 2022

DOI:

10.48550/arXiv.2202.05435

arXiv:

arXiv:2202.05435

Bibcode:

2022arXiv220205435K

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

Accepted to AAAI2022

NASA/ADS

Dual Task Framework for Improving Persona-grounded Dialogue Dataset

Abstract