To emphasize gene interactions in the classification algorithms, a new representation is proposed, comprising gene-pairs and not single genes. Each pair is represented by L1 difference in the corresponding expression values. The novel representation is evaluated on benchmark datasets and is shown to often increase classification accuracy for genetic datasets. Exploiting the gene-pair representation and the Gene Ontology (GO), the semantic similarity of gene pairs can be incorporated to pre-select pairs with a high similarity value. The GO-based feature selection approach is compared to the plain data driven selection and is shown to often increase classification accuracy. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Schön, T., Tsymbal, A., & Huber, M. (2010). Gene-pair representation and incorporation of GO-based semantic similarity into classification of gene expression data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6086 LNAI, pp. 217–226). https://doi.org/10.1007/978-3-642-13529-3_24
Mendeley helps you to discover research relevant for your work.