Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method

106Citations
Citations of this article
43Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper describes a fast and flexible descriptor selection method using a genetic algorithm variant (GASEC). The relevance of the descriptors will be measured using Shannon entropy (SE) and differential Shannon entropy (DSE), which have very sparse memory requirements and allow the processing of huge data sets. A small quantity of the most important descriptors will be used automatically to build a value prediction model. The most important descriptors are not a linear combination of other descriptors, but transparent, pure descriptors. We used an artificial neural network (ANN) model to predict the aqueous solubility logS and the octanol/water partition coefficient logP. The logS data set was divided into a training set of 1016 compounds and a test set of 253 compounds. A correlation coefficient of 0.93 and an empirical standard deviation of 0.54 were achieved. The logP data set was divided into a training set of 1853 compounds and a test set of 138 compounds. A correlation coefficient of 0.92 and an empirical standard deviation of 0.44 were achieved.

Cite

CITATION STYLE

APA

Wegner, J. K., & Zell, A. (2003). Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method. Journal of Chemical Information and Computer Sciences, 43(3), 1077–1084. https://doi.org/10.1021/ci034006u

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free