Differentially Private Algorithms for 2020 Census Detailed DHC Race \& Ethnicity
Abstract
This article describes a proposed differentially private (DP) algorithms that the US Census Bureau is considering to release the Detailed Demographic and Housing Characteristics (DHC) Race & Ethnicity tabulations as part of the 2020 Census. The tabulations contain statistics (counts) of demographic and housing characteristics of the entire population of the US crossed with detailed races and tribes at varying levels of geography. We describe two differentially private algorithmic strategies, one based on adding noise drawn from a two-sided Geometric distribution that satisfies "pure"-DP, and another based on adding noise from a Discrete Gaussian distribution that satisfied a well studied variant of differential privacy, called Zero Concentrated Differential Privacy (zCDP). We analytically estimate the privacy loss parameters ensured by the two algorithms for comparable levels of error introduced in the statistics.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2021
- DOI:
- 10.48550/arXiv.2107.10659
- arXiv:
- arXiv:2107.10659
- Bibcode:
- 2021arXiv210710659H
- Keywords:
-
- Computer Science - Cryptography and Security;
- Computer Science - Databases;
- Statistics - Applications
- E-Print:
- Presented at Theory and Practice of Differential Privacy Workshop (TPDP) 2021