Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models

doi:10.48550/arXiv.2305.10120

Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models

The recent proliferation of large-scale text-to-image models has led to growing concerns that such models may be misused to generate harmful, misleading, and inappropriate content. Motivated by this issue, we derive a technique inspired by continual learning to selectively forget concepts in pretrained deep generative models. Our method, dubbed Selective Amnesia, enables controllable forgetting where a user can specify how a concept should be forgotten. Selective Amnesia can be applied to conditional variational likelihood models, which encompass a variety of popular deep generative frameworks, including variational autoencoders and large-scale text-to-image diffusion models. Experiments across different models demonstrate that our approach induces forgetting on a variety of concepts, from entire classes in standard datasets to celebrity and nudity prompts in text-to-image models. Our code is publicly available at https://github.com/clear-nus/selective-amnesia.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.10120

arXiv:

arXiv:2305.10120

Bibcode:

2023arXiv230510120H

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence

NASA/ADS

Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models

Abstract