Transforming Question Answering Datasets Into Natural Language Inference Datasets

doi:10.48550/arXiv.1809.02922

Transforming Question Answering Datasets Into Natural Language Inference Datasets

Existing datasets for natural language inference (NLI) have propelled research on language understanding. We propose a new method for automatically deriving NLI datasets from the growing abundance of large-scale question answering datasets. Our approach hinges on learning a sentence transformation model which converts question-answer pairs into their declarative forms. Despite being primarily trained on a single QA dataset, we show that it can be successfully applied to a variety of other QA resources. Using this system, we automatically derive a new freely available dataset of over 500k NLI examples (QA-NLI), and show that it exhibits a wide range of inference phenomena rarely seen in previous NLI datasets.

Publication:

arXiv e-prints

Pub Date:

September 2018

DOI:

10.48550/arXiv.1809.02922

arXiv:

arXiv:1809.02922

Bibcode:

2018arXiv180902922D

Keywords:

Computer Science - Computation and Language

E-Print:

11 pages, 6 figures

NASA/ADS

Transforming Question Answering Datasets Into Natural Language Inference Datasets

Abstract