Data mining algorithms for virtual screening of bioactive compounds

3Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this chapter we study the problem of classifying chemical compound datasets. We present a sub-structure-based classification algorithm that decouples the sub-structure discovery process from the classification model construction and uses frequent subgraph discovery algorithms to find all topological and geometric sub-structures present in the dataset. The advantage of this approach is that during classification model construction, all relevant sub-structures are available allowing the classifier to intelligently select the most discriminating ones. The computational scalability is ensured by the use of highly efficient frequent subgraph discovery algorithms coupled with aggressive feature selection. Experimental evaluation on eight different classification problems shows that our approach is computationally scalable and on the average, outperforms existing schemes by 10% to 35%.

Cite

CITATION STYLE

APA

Deshpande, M., Kuramochi, M., & Karypis, G. (2007). Data mining algorithms for virtual screening of bioactive compounds. In Springer Optimization and Its Applications (Vol. 7, pp. 59–86). Springer International Publishing. https://doi.org/10.1007/978-0-387-69319-4_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free