Bayesian compositional regression with microbiome features via variational inference

Publication date

2023-12

Authors

Scott, Darren A.V.
Benavente, Ernest DiezORCID 0000-0002-4313-4290
Libiseller-Egger, Julian
Fedorov, Dmitry
Phelan, Jody
Ilina, Elena
Tikhonova, Polina
Kudryavstev, Alexander
Galeeva, Julia
Clark, Taane

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

The microbiome plays a key role in the health of the human body. Interest often lies in finding features of the microbiome, alongside other covariates, which are associated with a phenotype of interest. One important property of microbiome data, which is often overlooked, is its compositionality as it can only provide information about the relative abundance of its constituting components. Typically, these proportions vary by several orders of magnitude in datasets of high dimensions. To address these challenges we develop a Bayesian hierarchical linear log-contrast model which is estimated by mean field Monte-Carlo co-ordinate ascent variational inference (CAVI-MC) and easily scales to high dimensional data. We use novel priors which account for the large differences in scale and constrained parameter space associated with the compositional covariates. A reversible jump Monte Carlo Markov chain guided by the data through univariate approximations of the variational posterior probability of inclusion, with proposal parameters informed by approximating variational densities via auxiliary parameters, is used to estimate intractable marginal expectations. We demonstrate that our proposed Bayesian method performs favourably against existing frequentist state of the art compositional data analysis methods. We then apply the CAVI-MC to the analysis of real data exploring the relationship of the gut microbiome to body mass index.

Keywords

Compositional, Markov chain Monte Carlo, Microbiome, Singular multivariate normal, Variational inference, Structural Biology, Biochemistry, Molecular Biology, Computer Science Applications, Applied Mathematics

Citation

Scott, D A V, Benavente, E, Libiseller-Egger, J, Fedorov, D, Phelan, J, Ilina, E, Tikhonova, P, Kudryavstev, A, Galeeva, J, Clark, T & Lewin, A 2023, 'Bayesian compositional regression with microbiome features via variational inference', BMC Bioinformatics, vol. 24, no. 1, 210. https://doi.org/10.1186/s12859-023-05219-x