On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

doi:10.48550/arXiv.1909.03186

On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

We present a method to produce abstractive summaries of long documents that exceed several thousand words via neural abstractive summarization. We perform a simple extractive step before generating a summary, which is then used to condition the transformer language model on relevant information before being tasked with generating a summary. We show that this extractive step significantly improves summarization results. We also show that this approach produces more abstractive summaries compared to prior work that employs a copy mechanism while still achieving higher rouge scores. Note: The abstract above was not written by the authors, it was generated by one of the models presented in this paper.

Publication:

arXiv e-prints

Pub Date:

September 2019

DOI:

10.48550/arXiv.1909.03186

arXiv:

arXiv:1909.03186

Bibcode:

2019arXiv190903186S

Keywords:

Computer Science - Computation and Language

NASA/ADS

On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

Abstract