Minimax Lower Bounds for Linear Independence Testing

doi:10.48550/arXiv.1601.06259

Minimax Lower Bounds for Linear Independence Testing

Linear independence testing is a fundamental information-theoretic and statistical problem that can be posed as follows: given $n$ points $\{(X_i,Y_i)\}^n_{i=1}$ from a $p+q$ dimensional multivariate distribution where $X_i \in \mathbb{R}^p$ and $Y_i \in\mathbb{R}^q$, determine whether $a^T X$ and $b^T Y$ are uncorrelated for every $a \in \mathbb{R}^p, b\in \mathbb{R}^q$ or not. We give minimax lower bound for this problem (when $p+q,n \to \infty$, $(p+q)/n \leq \kappa < \infty$, without sparsity assumptions). In summary, our results imply that $n$ must be at least as large as $\sqrt {pq}/\|\Sigma_{XY}\|_F^2$ for any procedure (test) to have non-trivial power, where $\Sigma_{XY}$ is the cross-covariance matrix of $X,Y$. We also provide some evidence that the lower bound is tight, by connections to two-sample testing and regression in specific settings.

Publication:

arXiv e-prints

Pub Date:

January 2016

DOI:

10.48550/arXiv.1601.06259

arXiv:

arXiv:1601.06259

Bibcode:

2016arXiv160106259R

Keywords:

Statistics - Machine Learning;
Computer Science - Information Theory;
Computer Science - Machine Learning;
Mathematics - Statistics Theory

E-Print:

9 pages

NASA/ADS

Minimax Lower Bounds for Linear Independence Testing

Abstract