Abstract
In this paper we present a new approach to feature selection for sequence data. We identify general feature categories and give construction algorithms for each of them. We show how they can be integrated in a system that tightly couples feature construction and feature selection. This integrated process, which we refer to as, feature generation, allows us to systematically search a large space of potential features. We demonstrate the effectiveness of our approach for an important component of the gene finding problem, splice-site prediction. We show that predictive models built using our feature generation algorithm achieve a significant improvement in accuracy over existing, state-of-the-art approaches. © Springer-Verlag Berlin Heidelberg 2006.
Cite
CITATION STYLE
Islamaj, R., Getoor, L., & Wilbur, W. J. (2006). A feature generation algorithm for sequences with application to splice-site prediction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4213 LNAI, pp. 553–560). Springer Verlag. https://doi.org/10.1007/11871637_55
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.