Abstract
Motivation: Microarray experiments generate large datasets with expression values for thousands of genes but not more than a few dozens of samples. Accurate supervised classification of tissue samples in such high-dimensional problems is difficult but often crucial for successful diagnosis and treatment. A promising way to meet this challenge is by using boosting in conjunction with decision trees. Results: We demonstrate that the generic boosting algorithm needs some modification to become an accurate classifier in the context of gene expression data. In particular, we present a feature preselection method, a more robust boosting procedure and a new approach for multicategorical problems. This allows for slight to drastic increase in performance and yields competitive results on several publicly available datasets.
Cite
CITATION STYLE
Dettling, M., & Bühlmann, P. (2003). Boosting for tumor classification with gene expression data. Bioinformatics, 19(9), 1061–1069. https://doi.org/10.1093/bioinformatics/btf867
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.