Conditional Generative Models for Counterfactual Explanations

doi:10.48550/arXiv.2101.10123

Conditional Generative Models for Counterfactual Explanations

Counterfactual instances offer human-interpretable insight into the local behaviour of machine learning models. We propose a general framework to generate sparse, in-distribution counterfactual model explanations which match a desired target prediction with a conditional generative model, allowing batches of counterfactual instances to be generated with a single forward pass. The method is flexible with respect to the type of generative model used as well as the task of the underlying predictive model. This allows straightforward application of the framework to different modalities such as images, time series or tabular data as well as generative model paradigms such as GANs or autoencoders and predictive tasks like classification or regression. We illustrate the effectiveness of our method on image (CelebA), time series (ECG) and mixed-type tabular (Adult Census) data.

Publication:

arXiv e-prints

Pub Date:

January 2021

DOI:

10.48550/arXiv.2101.10123

arXiv:

arXiv:2101.10123

Bibcode:

2021arXiv210110123V

Keywords:

Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

12 pages

NASA/ADS

Conditional Generative Models for Counterfactual Explanations

Abstract