Explaining high-dimensional text classifiers

doi:10.48550/arXiv.2311.13454

Explaining high-dimensional text classifiers

Explainability has become a valuable tool in the last few years, helping humans better understand AI-guided decisions. However, the classic explainability tools are sometimes quite limited when considering high-dimensional inputs and neural network classifiers. We present a new explainability method using theoretically proven high-dimensional properties in neural network classifiers. We present two usages of it: 1) On the classical sentiment analysis task for the IMDB reviews dataset, and 2) our Malware-Detection task for our PowerShell scripts dataset.

Publication:

arXiv e-prints

Pub Date:

November 2023

DOI:

10.48550/arXiv.2311.13454

arXiv:

arXiv:2311.13454

Bibcode:

2023arXiv231113454M

Keywords:

Computer Science - Machine Learning;
Computer Science - Cryptography and Security;
Computer Science - Neural and Evolutionary Computing;
Statistics - Machine Learning

E-Print:

Accepted to "XAI in Action" workshop @ NeurIPS 2023

NASA/ADS

Explaining high-dimensional text classifiers

Abstract