Generative Adversarial Networks for Synthetic Data Generation: A Comparative Study
Abstract
Generative Adversarial Networks (GANs) are gaining increasing attention as a means for synthesising data. So far much of this work has been applied to use cases outside of the data confidentiality domain with a common application being the production of artificial images. Here we consider the potential application of GANs for the purpose of generating synthetic census microdata. We employ a battery of utility metrics and a disclosure risk metric (the Targeted Correct Attribution Probability) to compare the data produced by tabular GANs with those produced using orthodox data synthesis methods.
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2021
- DOI:
- 10.48550/arXiv.2112.01925
- arXiv:
- arXiv:2112.01925
- Bibcode:
- 2021arXiv211201925L
- Keywords:
-
- Computer Science - Machine Learning