Rationale-Enhanced Language Models are Better Continual Relation Learners

doi:10.48550/arXiv.2310.06547

Rationale-Enhanced Language Models are Better Continual Relation Learners

Continual relation extraction (CRE) aims to solve the problem of catastrophic forgetting when learning a sequence of newly emerging relations. Recent CRE studies have found that catastrophic forgetting arises from the model's lack of robustness against future analogous relations. To address the issue, we introduce rationale, i.e., the explanations of relation classification results generated by large language models (LLM), into CRE task. Specifically, we design the multi-task rationale tuning strategy to help the model learn current relations robustly. We also conduct contrastive rationale replay to further distinguish analogous relations. Experimental results on two standard benchmarks demonstrate that our method outperforms the state-of-the-art CRE models.

Publication:

arXiv e-prints

Pub Date:

October 2023

DOI:

10.48550/arXiv.2310.06547

arXiv:

arXiv:2310.06547

Bibcode:

2023arXiv231006547X

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence

E-Print:

Accepted at EMNLP 2023

NASA/ADS

Rationale-Enhanced Language Models are Better Continual Relation Learners

Abstract