Low impact agency: review and discussion

doi:10.48550/arXiv.2303.03139

Low impact agency: review and discussion

Powerful artificial intelligence poses an existential threat if the AI decides to drastically change the world in pursuit of its goals. The hope of low-impact artificial intelligence is to incentivize AI to not do that just because this causes a large impact in the world. In this work, we first review the concept of low-impact agency and previous proposals to approach the problem, and then propose future research directions in the topic, with the goal to ensure low-impactedness is useful in making AI safe.

Publication:

arXiv e-prints

Pub Date:

March 2023

DOI:

10.48550/arXiv.2303.03139

arXiv:

arXiv:2303.03139

Bibcode:

2023arXiv230303139N

Keywords:

Computer Science - Artificial Intelligence;
Computer Science - Computers and Society

E-Print:

Work done as part of the SERIMATS 3.0 training program

ADS

Low impact agency: review and discussion

Abstract