Generalization in Generation: A closer look at Exposure Bias

doi:10.48550/arXiv.1910.00292

Generalization in Generation: A closer look at Exposure Bias

Schmidt, Florian

Exposure bias refers to the train-test discrepancy that seemingly arises when an autoregressive generative model uses only ground-truth contexts at training time but generated ones at test time. We separate the contributions of the model and the learning framework to clarify the debate on consequences and review proposed counter-measures. In this light, we argue that generalization is the underlying property to address and propose unconditional generation as its fundamental benchmark. Finally, we combine latent variable modeling with a recent formulation of exploration in reinforcement learning to obtain a rigorous handling of true and generated contexts. Results on language modeling and variational sentence auto-encoding confirm the model's generalization capability.

Publication:

arXiv e-prints

Pub Date:

October 2019

DOI:

10.48550/arXiv.1910.00292

arXiv:

arXiv:1910.00292

Bibcode:

2019arXiv191000292S

Keywords:

Computer Science - Machine Learning;
Computer Science - Computation and Language;
Statistics - Machine Learning

E-Print:

wngt2019 camera ready

NASA/ADS

Generalization in Generation: A closer look at Exposure Bias

Abstract