Multiple Imputation and Random Forests (MIRF) for unobservable, high-dimensional data

8Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

Understanding the genetic underpinnings to complex diseases requires consideration of sophisticated analytical methods designed to uncover intricate associations across multiple predictor variables. At the same time, knowledge of whether single nucleotide polymorphisms within a gene are on the same (in cis) or on different (in trans) chromosomal copies, may provide crucial information about measures of disease progression. In association studies of unrelated individuals, allelic phase is generally unobservable, generating an additional analytical challenge. In this manuscript, we describe a novel approach that combines multiple imputation and random forests for this high-dimensional, unobservable data setting. An application to a cohort of IHV-1 infected individuals receiving anti-retroviral therapies is presented. A simulation study is also presented to characterize method performance. Copyright © 2007 The Berkeley Electronic Press. All rights reserved.

Cite

CITATION STYLE

APA

Nonyane, B. A. S., & Foulkes, A. S. (2007). Multiple Imputation and Random Forests (MIRF) for unobservable, high-dimensional data. International Journal of Biostatistics, 3(1). https://doi.org/10.2202/1557-4679.1049

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free