Separating Retention from Extraction in the Evaluation of End-to-end Relation Extraction

doi:10.48550/arXiv.2109.12008

Separating Retention from Extraction in the Evaluation of End-to-end Relation Extraction

State-of-the-art NLP models can adopt shallow heuristics that limit their generalization capability (McCoy et al., 2019). Such heuristics include lexical overlap with the training set in Named-Entity Recognition (Taillé et al., 2020) and Event or Type heuristics in Relation Extraction (Rosenman et al., 2020). In the more realistic end-to-end RE setting, we can expect yet another heuristic: the mere retention of training relation triples. In this paper, we propose several experiments confirming that retention of known facts is a key factor of performance on standard benchmarks. Furthermore, one experiment suggests that a pipeline model able to use intermediate type representations is less prone to over-rely on retention.

Publication:

arXiv e-prints

Pub Date:

September 2021

DOI:

10.48550/arXiv.2109.12008

arXiv:

arXiv:2109.12008

Bibcode:

2021arXiv210912008T

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning

E-Print:

Accepted at EMNLP 2021

NASA/ADS

Separating Retention from Extraction in the Evaluation of End-to-end Relation Extraction

Abstract