Structural zeros in high-dimensional data with applications to microbiome studies

17Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper is motivated by the recent interest in the analysis of high-dimensional microbiome data. A key feature of these data is the presence of "structural zeros" which are microbes missing from an observation vector due to an underlying biological process and not due to error in measurement. Typical notions of missingness are unable to model these structural zeros. We define a general framework which allows for structural zeros in the model and propose methods of estimating sparse high-dimensional covariance and precision matrices under this setup. We establish error bounds in the spectral and Frobenius norms for the proposed estimators and empirically verify them with a simulation study. The proposed methodology is illustrated by applying it to the global gut microbiome data of Yatsunenko and others (2012. Human gut microbiome viewed across age and geography. Nature 486, 222-227). Using our methodology we classify subjects according to the geographical location on the basis of their gut microbiome.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kaul, A., Davidov, O., & Peddada, S. D. (2017). Structural zeros in high-dimensional data with applications to microbiome studies. Biostatistics, 18(3), 422–433. https://doi.org/10.1093/biostatistics/kxw053

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 27

84%

Researcher 4

13%

Professor / Associate Prof. 1

3%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 8

36%

Mathematics 8

36%

Medicine and Dentistry 3

14%

Immunology and Microbiology 3

14%

Save time finding and organizing research with Mendeley

Sign up for free