Blameworthiness in Multi-Agent Settings

doi:10.48550/arXiv.1903.04102

Blameworthiness in Multi-Agent Settings

We provide a formal definition of blameworthiness in settings where multiple agents can collaborate to avoid a negative outcome. We first provide a method for ascribing blameworthiness to groups relative to an epistemic state (a distribution over causal models that describe how the outcome might arise). We then show how we can go from an ascription of blameworthiness for groups to an ascription of blameworthiness for individuals using a standard notion from cooperative game theory, the Shapley value. We believe that getting a good notion of blameworthiness in a group setting will be critical for designing autonomous agents that behave in a moral manner.

Publication:

arXiv e-prints

Pub Date:

March 2019

DOI:

10.48550/arXiv.1903.04102

arXiv:

arXiv:1903.04102

Bibcode:

2019arXiv190304102F

Keywords:

Computer Science - Computers and Society;
Computer Science - Artificial Intelligence;
Computer Science - Multiagent Systems

E-Print:

Appears in AAAI-19

NASA/ADS

Blameworthiness in Multi-Agent Settings

Abstract