A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference

doi:10.48550/arXiv.2204.05428

A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference

Most evaluations of attribution methods focus on the English language. In this work, we present a multilingual approach for evaluating attribution methods for the Natural Language Inference (NLI) task in terms of faithfulness and plausibility. First, we introduce a novel cross-lingual strategy to measure faithfulness based on word alignments, which eliminates the drawbacks of erasure-based evaluations.We then perform a comprehensive evaluation of attribution methods, considering different output mechanisms and aggregation methods. Finally, we augment the XNLI dataset with highlight-based explanations, providing a multilingual NLI dataset with highlights, to support future exNLP studies. Our results show that attribution methods performing best for plausibility and faithfulness are different.

Publication:

arXiv e-prints

Pub Date:

April 2022

DOI:

10.48550/arXiv.2204.05428

arXiv:

arXiv:2204.05428

Bibcode:

2022arXiv220405428Z

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence

E-Print:

21 pages, 7 figures. Code and data at https://keremzaman.com/explaiNLI/

NASA/ADS

A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference

Abstract