Abstract
Purpose: Automatic anatomical therapeutic chemical (ATC) classification is progressing at a rapid pace because of its potential in drug development. Predicting an unknown compound's therapeutic and chemical characteristics in terms of how it affects multiple organs and physiological systems makes automatic ATC classification a vital yet challenging multilabel problem. The aim of this paper is to experimentally derive an ensemble of different feature descriptors and classifiers for ATC classification that outperforms the state-of-the-art. Design/methodology/approach: The proposed method is an ensemble generated by the fusion of neural networks (i.e. a tabular model and long short-term memory networks (LSTM)) and multilabel classifiers based on multiple linear regression (hMuLab). All classifiers are trained on three sets of descriptors. Features extracted from the trained LSTMs are also fed into hMuLab. Evaluations of ensembles are compared on a benchmark data set of 3883 ATC-coded pharmaceuticals taken from KEGG, a publicly available drug databank. Findings: Experiments demonstrate the power of the authors’ best ensemble, EnsATC, which is shown to outperform the best methods reported in the literature, including the state-of-the-art developed by the fast.ai research group. The MATLAB source code of the authors’ system is freely available to the public at https://github.com/LorisNanni/Neural-networks-for-anatomical-therapeutic-chemical-ATC-classification. Originality/value: This study demonstrates the power of extracting LSTM features and combining them with ATC descriptors in ensembles for ATC classification.
Author supplied keywords
Cite
CITATION STYLE
Nanni, L., Lumini, A., & Brahnam, S. (2022). Neural networks for anatomical therapeutic chemical (ATC) classification. Applied Computing and Informatics. https://doi.org/10.1108/ACI-11-2021-0301
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.