Sequence Level Contrastive Learning for Text Summarization

doi:10.48550/arXiv.2109.03481

Sequence Level Contrastive Learning for Text Summarization

Contrastive learning models have achieved great success in unsupervised visual representation learning, which maximize the similarities between feature representations of different views of the same image, while minimize the similarities between feature representations of views of different images. In text summarization, the output summary is a shorter form of the input document and they have similar meanings. In this paper, we propose a contrastive learning model for supervised abstractive text summarization, where we view a document, its gold summary and its model generated summaries as different views of the same mean representation and maximize the similarities between them during training. We improve over a strong sequence-to-sequence text generation model (i.e., BART) on three different summarization datasets. Human evaluation also shows that our model achieves better faithfulness ratings compared to its counterpart without contrastive objectives.

Publication:

arXiv e-prints

Pub Date:

September 2021

DOI:

10.48550/arXiv.2109.03481

arXiv:

arXiv:2109.03481

Bibcode:

2021arXiv210903481X

Keywords:

Computer Science - Computation and Language

E-Print:

2 figures, 12 tables

NASA/ADS

Sequence Level Contrastive Learning for Text Summarization

Abstract