Chemoinformatic Classification Methods and their Applicability Domain

Miriam Mathea; Waldemar Klingspohn; Knut Baumann

ArticleOPEN ACCESS

Chemoinformatic Classification Methods and their Applicability Domain

Molecular Informatics

DOI: 10.1002/minf.201501019

111Citations

132Readers

Abstract

Classification rules are often used in chemoinformatics to predict categorical properties of drug candidates related to bioactivity from explanatory variables, which encode the respective molecular structures (i.e. molecular descriptors). To avoid predictions with an unduly large error probability, the domain the classifier is applied to should be restricted to the domain covered by the training set objects. This latter domain is commonly referred to as applicability domain in chemoinformatics. Conceptually, the applicability domain defines the region in space where the "normal" objects are located. Defining the border of the applicability domain may then be viewed as detecting anomalous or novel objects or as detecting outliers. Currently two different types of measures are in use. The first one defines the applicability domain solely in terms of the molecular descriptor space, which is referred to as novelty detection. The second type defines the applicability domain in terms of the expected reliability of the predictions which is referred to as confidence estimation. Both types are systematically differentiated here and the most popular measures are reviewed. It will be shown that all common chemoinformatic classifiers have built-in confidence scores. Since confidence estimation uses information of the class labels for computing the confidence scores, it is expected to be more efficient in reducing the error rate than novelty detection, which solely uses the information of the explanatory variables.

Author supplied keywords

Cite

CITATION STYLE

APA

Mathea, M., Klingspohn, W., & Baumann, K. (2016, May 1). Chemoinformatic Classification Methods and their Applicability Domain. Molecular Informatics. Wiley-VCH Verlag. https://doi.org/10.1002/minf.201501019

Chemoinformatic Classification Methods and their Applicability Domain

Abstract

Author supplied keywords

Cite

Register to see more suggestions