Linear Regression with an Unknown Permutation: Statistical and Computational Limits
Abstract
Consider a noisy linear observation model with an unknown permutation, based on observing $y = \Pi^* A x^* + w$, where $x^* \in \mathbb{R}^d$ is an unknown vector, $\Pi^*$ is an unknown $n \times n$ permutation matrix, and $w \in \mathbb{R}^n$ is additive Gaussian noise. We analyze the problem of permutation recovery in a random design setting in which the entries of the matrix $A$ are drawn i.i.d. from a standard Gaussian distribution, and establish sharp conditions on the SNR, sample size $n$, and dimension $d$ under which $\Pi^*$ is exactly and approximately recoverable. On the computational front, we show that the maximum likelihood estimate of $\Pi^*$ is NP-hard to compute, while also providing a polynomial time algorithm when $d =1$.
- Publication:
-
arXiv e-prints
- Pub Date:
- August 2016
- DOI:
- 10.48550/arXiv.1608.02902
- arXiv:
- arXiv:1608.02902
- Bibcode:
- 2016arXiv160802902P
- Keywords:
-
- Mathematics - Statistics Theory;
- Computer Science - Information Theory;
- Statistics - Machine Learning
- E-Print:
- To appear in part at the 2016 Allerton Conference on Control, Communication and Computing