Blameworthiness in Multi-Agent Settings
Abstract
We provide a formal definition of blameworthiness in settings where multiple agents can collaborate to avoid a negative outcome. We first provide a method for ascribing blameworthiness to groups relative to an epistemic state (a distribution over causal models that describe how the outcome might arise). We then show how we can go from an ascription of blameworthiness for groups to an ascription of blameworthiness for individuals using a standard notion from cooperative game theory, the Shapley value. We believe that getting a good notion of blameworthiness in a group setting will be critical for designing autonomous agents that behave in a moral manner.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2019
- DOI:
- 10.48550/arXiv.1903.04102
- arXiv:
- arXiv:1903.04102
- Bibcode:
- 2019arXiv190304102F
- Keywords:
-
- Computer Science - Computers and Society;
- Computer Science - Artificial Intelligence;
- Computer Science - Multiagent Systems
- E-Print:
- Appears in AAAI-19