Confidence-ranked reconstruction of census microdata from published statistics
Abstract
We show how to launch a reconstruction attack on US Decennial Census data based only on publicly released statistics. The attack can recover individual microdata—i.e., the responses of individual Census survey respondents. Although our attack cannot reconstruct all rows of the private data, it can produce a confidence-based ranking of rows, such that rows that appear earlier in the ranking are more likely to appear in the private data. Thus, the attacker can reconstruct a fraction of the rows with confidence. We compare our attack to a hierarchy of increasingly strong baselines and show that we can outperform all of them. Our results point to the necessity of employing privacy-enhancing technologies when releasing statistics of privacy-sensitive datasets.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- February 2023
- DOI:
- arXiv:
- arXiv:2211.03128
- Bibcode:
- 2023PNAS..12018605D
- Keywords:
-
- Computer Science - Computers and Society;
- Computer Science - Cryptography and Security;
- Computer Science - Machine Learning
- E-Print:
- doi:10.1073/pnas.2218605120