Poisson PCA for matrix count data
Abstract
We develop a dimension reduction framework for data consisting of matrices of counts. Our model is based on the assumption of existence of a small amount of independent normal latent variables that drive the dependency structure of the observed data, and can be seen as the exact discrete analogue of a contaminated low-rank matrix normal model. We derive estimators for the model parameters and establish their limiting normality. An extension of a recent proposal from the literature is used to estimate the latent dimension of the model. The method is shown to outperform both its vectorization-based competitors and matrix methods assuming the continuity of the data distribution in analysing simulated data and real world abundance data.
- Publication:
-
Pattern Recognition
- Pub Date:
- June 2023
- DOI:
- arXiv:
- arXiv:2110.14420
- Bibcode:
- 2023PatRe.13809401V
- Keywords:
-
- Discrete data;
- Kronecker model;
- Matrix normal distribution;
- Poisson log-normal distribution;
- Mathematics - Statistics Theory
- E-Print:
- 19 pages, 7 figures