Evaluation of single-nucleotide polymorphism imputation using random forests

  • Schwarz D
  • Szymczak S
  • Ziegler A
  • et al.
N/ACitations
Citations of this article
25Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Genome-wide association studies (GWAS) have helped to reveal genetic mechanisms of complex diseases. Although commonly used genotyping technology enables us to determine up to a million single-nucleotide polymorphisms (SNPs), causative variants are typically not genotyped directly. A favored approach to increase the power of genome-wide association studies is to impute the untyped SNPs using more complete genotype data of a reference population.Random forests (RF) provides an internal method for replacing missing genotypes. A forest of classification trees is used to determine similarities of probands regarding their genotypes. These proximities are then used to impute genotypes of untyped SNPs.We evaluated this approach using genotype data of the Framingham Heart Study provided as Problem 2 for Genetic Analysis Workshop 16 and the Caucasian HapMap samples as reference population. Our results indicate that RFs are faster but less accurate than alternative approaches for imputing untyped SNPs.

Cite

CITATION STYLE

APA

Schwarz, D. F., Szymczak, S., Ziegler, A., & König, I. R. (2009). Evaluation of single-nucleotide polymorphism imputation using random forests. BMC Proceedings, 3(S7). https://doi.org/10.1186/1753-6561-3-s7-s65

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free