Training an adaptive dialogue policy for interactive learning of visually grounded word meanings

doi:10.48550/arXiv.1709.10426

Training an adaptive dialogue policy for interactive learning of visually grounded word meanings

We present a multi-modal dialogue system for interactive learning of perceptually grounded word meanings from a human tutor. The system integrates an incremental, semantic parsing/generation framework - Dynamic Syntax and Type Theory with Records (DS-TTR) - with a set of visual classifiers that are learned throughout the interaction and which ground the meaning representations that it produces. We use this system in interaction with a simulated human tutor to study the effects of different dialogue policies and capabilities on the accuracy of learned meanings, learning rates, and efforts/costs to the tutor. We show that the overall performance of the learning agent is affected by (1) who takes initiative in the dialogues; (2) the ability to express/use their confidence level about visual attributes; and (3) the ability to process elliptical and incrementally constructed dialogue turns. Ultimately, we train an adaptive dialogue policy which optimises the trade-off between classifier accuracy and tutoring costs.

Publication:

arXiv e-prints

Pub Date:

September 2017

DOI:

10.48550/arXiv.1709.10426

arXiv:

arXiv:1709.10426

Bibcode:

2017arXiv170910426Y

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning;
Computer Science - Robotics

E-Print:

11 pages, SIGDIAL 2016 Conference

NASA/ADS

Training an adaptive dialogue policy for interactive learning of visually grounded word meanings

Abstract