Parallel classification and feature selection in microarray data using SPRINT

Lawrence Mitchell; Terence M. Sloan; Muriel Mewissen; Peter Ghazal; Thorsten Forster; Michal Piotrowski; Arthur Trew

Journal ArticleOPEN ACCESS

Parallel classification and feature selection in microarray data using SPRINT

Concurrency and Computation: Practice and Experience (2014) 26(4) 854-865

DOI: 10.1002/cpe.2928

14Citations

21Readers

Get full text

Abstract

The statistical language R is favoured by many biostatisticians for processing microarray data. In recent times, the quantity of data that can be obtained in experiments has risen significantly, making previously fast analyses time consuming or even not possible at all with the existing software infrastructure. High performance computing (HPC) systems offer a solution to these problems but at the expense of increased complexity for the end user. The Simple Parallel R Interface is a library for R that aims to reduce the complexity of using HPC systems by providing biostatisticians with drop-in parallelised replacements of existing R functions. In this paper we describe parallel implementations of two popular techniques: exploratory clustering analyses using the random forest classifier and feature selection through identification of differentially expressed genes using the rank product method. Copyright © 2012 John Wiley & Sons, Ltd.

Author supplied keywords

Cite

CITATION STYLE

APA

Mitchell, L., Sloan, T. M., Mewissen, M., Ghazal, P., Forster, T., Piotrowski, M., & Trew, A. (2014). Parallel classification and feature selection in microarray data using SPRINT. Concurrency and Computation: Practice and Experience, 26(4), 854–865. https://doi.org/10.1002/cpe.2928

Parallel classification and feature selection in microarray data using SPRINT

Abstract

Author supplied keywords

Cite

Register to see more suggestions