Efficient Bayesian Additive Regression Models For Microbiome Studies
Abstract
Statistical analysis of microbiome data is challenging. Bayesian multinomial logistic-normal (MLN) models have gained popularity due to their ability to account for the count compositional nature of these data. However, these models are often computationally intractable to infer. Recently, we developed a computationally efficient and accurate approach to inferring MLN models with a Marginally Latent Matrix-T Process (MLTP) form: MLN-MLTPs. Our approach is based on a novel sampler with a marginal Laplace approximation -- called the \textit{Collapse-Uncollapse} (CU) sampler. However, existing work with MLTPs has been limited to linear models or models of a single non-linear process. Moreover, existing methods lack an efficient means of estimating model hyperparameters. This article addresses both deficiencies. We introduce a new class of MLN Additive Gaussian Process models (\textit{MultiAddGPs}) for deconvolution of overlapping linear and non-linear processes. We show that MultiAddGPs are examples of MLN-MLTPs and derive an efficient CU sampler for this model class. Moreover, we derive efficient Maximum Marginal Likelihood estimation for hyperparameters in MLTP models by taking advantage of Laplace approximation in the CU sampler. We demonstrate our approach using simulated and real data studies. Our models produce novel biological insights from a previously published artificial gut study.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2024
- DOI:
- 10.48550/arXiv.2410.03911
- arXiv:
- arXiv:2410.03911
- Bibcode:
- 2024arXiv241003911C
- Keywords:
-
- Statistics - Methodology;
- Statistics - Applications