Statistical properties of multivariate distance matrix regression for high-dimensional data analysis

54Citations
Citations of this article
142Readers
Mendeley users who have this article in their library.

Abstract

Multivariate distance matrix regression (MDMR) analysis is a statistical technique that allows researchers to relate P variables to an additional M factors collected on N individuals, where P»N. The technique can be applied to a number of research settings involving high-dimensional data types such as DNA sequence data, gene expression microarray data, and imaging data. MDMR analysis involves computing the distance between all pairs of individuals with respect to P variables of interest and constructing an N × N matrix whose elements reflect these distances. Permutation tests can be used to test linear hypotheses that consider whether or not the M additional factors collected on the individuals can explain variation in the observed distances between and among the N individuals as reflected in the matrix. Despite its appeal and utility, properties of the statistics used in MDMR analysis have not been explored in detail. In this paper we consider the level accuracy and power of MDMR analysis assuming different distance measures and analysis settings. We also describe the utility of MDMR analysis in assessing hypotheses about the appropriate number of clusters arising from a cluster analysis. © 2012 Zapala and Schork.

Cite

CITATION STYLE

APA

Zapala, M. A., & Schork, N. J. (2012). Statistical properties of multivariate distance matrix regression for high-dimensional data analysis. Frontiers in Genetics, 3(SEP). https://doi.org/10.3389/fgene.2012.00190

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free