Low impact agency: review and discussion
Abstract
Powerful artificial intelligence poses an existential threat if the AI decides to drastically change the world in pursuit of its goals. The hope of low-impact artificial intelligence is to incentivize AI to not do that just because this causes a large impact in the world. In this work, we first review the concept of low-impact agency and previous proposals to approach the problem, and then propose future research directions in the topic, with the goal to ensure low-impactedness is useful in making AI safe.
- Publication:
-
arXiv e-prints
- Pub Date:
- March 2023
- DOI:
- arXiv:
- arXiv:2303.03139
- Bibcode:
- 2023arXiv230303139N
- Keywords:
-
- Computer Science - Artificial Intelligence;
- Computer Science - Computers and Society
- E-Print:
- Work done as part of the SERIMATS 3.0 training program