A Pilot Study of Domain Adaptation Effect for Neural Abstractive Summarization
Abstract
We study the problem of domain adaptation for neural abstractive summarization. We make initial efforts in investigating what information can be transferred to a new domain. Experimental results on news stories and opinion articles indicate that neural summarization model benefits from pre-training based on extractive summaries. We also find that the combination of in-domain and out-of-domain setup yields better summaries when in-domain data is insufficient. Further analysis shows that, the model is capable to select salient content even trained on out-of-domain data, but requires in-domain data to capture the style for a target domain.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2017
- DOI:
- arXiv:
- arXiv:1707.07062
- Bibcode:
- 2017arXiv170707062H
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- This paper is accepted by EMNLP 2017 Workshop on New Frontiers in Summarization