Learning to Recombine and Resample Data for Compositional Generalization

doi:10.48550/arXiv.2010.03706

Learning to Recombine and Resample Data for Compositional Generalization

Flexible neural sequence models outperform grammar- and automaton-based counterparts on a variety of tasks. However, neural models perform poorly in settings requiring compositional generalization beyond the training data -- particularly to rare or unseen subsequences. Past work has found symbolic scaffolding (e.g. grammars or automata) essential in these settings. We describe R&R, a learned data augmentation scheme that enables a large category of compositional generalizations without appeal to latent symbolic structure. R&R has two components: recombination of original training examples via a prototype-based generative model and resampling of generated examples to encourage extrapolation. Training an ordinary neural sequence model on a dataset augmented with recombined and resampled examples significantly improves generalization in two language processing problems -- instruction following (SCAN) and morphological analysis (SIGMORPHON 2018) -- where R&R enables learning of new constructions and tenses from as few as eight initial examples.

Publication:

arXiv e-prints

Pub Date:

October 2020

DOI:

10.48550/arXiv.2010.03706

arXiv:

arXiv:2010.03706

Bibcode:

2020arXiv201003706A

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning

E-Print:

ICLR2021

NASA/ADS

Learning to Recombine and Resample Data for Compositional Generalization

Abstract