A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings

doi:10.48550/arXiv.2309.10551

A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings

We propose a Neighbourhood-Aware Differential Privacy (NADP) mechanism considering the neighbourhood of a word in a pretrained static word embedding space to determine the minimal amount of noise required to guarantee a specified privacy level. We first construct a nearest neighbour graph over the words using their embeddings, and factorise it into a set of connected components (i.e. neighbourhoods). We then separately apply different levels of Gaussian noise to the words in each neighbourhood, determined by the set of words in that neighbourhood. Experiments show that our proposed NADP mechanism consistently outperforms multiple previously proposed DP mechanisms such as Laplacian, Gaussian, and Mahalanobis in multiple downstream tasks, while guaranteeing higher levels of privacy.

Publication:

arXiv e-prints

Pub Date:

September 2023

DOI:

10.48550/arXiv.2309.10551

arXiv:

arXiv:2309.10551

Bibcode:

2023arXiv230910551B

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Computation and Language;
Computer Science - Cryptography and Security

E-Print:

Accepted to IJCNLP-AACL 2023

NASA/ADS

A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings

Abstract