Abstract
Background: The Anatomical Therapeutic Chemicals (ATC) classification system is a widely accepted drug classification system. It classifies drugs according to the organ or system in which they can operate and their therapeutic, pharmacological, and chemical properties. Assigning drugs into 14 classes in the first level of the system is an essential step to understanding drug properties. Several multi-label classifiers have been proposed to identify drug classes. Although their performance was good, most classifiers directly only adopted drug relationships or the features derived from these relationships, but the essential properties of drugs were not directly employed. Thus, classifiers still have a space for improvement.
Objective: The aim of this study was to build a novel and powerful multilabel classifier for identifying classes in the first level of the ATC classification system for given drugs.
Methods: A powerful multi-label classifier, namely, iATC-NFMLP, was proposed. Two feature types were adopted to encode each drug. The first type was derived from drug relationships via a network embedding algorithm, whereas the second one represented the fingerprints of drugs. Multilayer perceptron using sigmoid as the activating function was used to learn these features for the construction of the classifier.
Results: The 10-fold cross-validation results indicated that a combination of the two feature types could improve the performance of the classifier. The jackknife test on the benchmark dataset with 3883 drugs showed that the accuracy and absolute true were 82.76% and 79.27%, respectively.
Conclusion: The performance of iATC-NFMLP was best compared with all previous classifiers.
Keywords: Drug, ATC classification system, multi-label classification, network embedding algorithm, fingerprint, multilayer perceptron.
Graphical Abstract
[http://dx.doi.org/10.1016/j.csbj.2018.11.007] [PMID: 30595814]
[PMID: 34161205]
[http://dx.doi.org/10.2174/1568026619666191203113745] [PMID: 31797761]
[http://dx.doi.org/10.2174/1389450119666180809122244] [PMID: 30091413]
[http://dx.doi.org/10.1007/s11030-017-9732-0] [PMID: 28275924]
[http://dx.doi.org/10.1155/2017/4649191] [PMID: 28630865]
[http://dx.doi.org/10.1371/journal.pone.0035254] [PMID: 22514724]
[http://dx.doi.org/10.1039/c3mb70490d] [PMID: 24492783]
[http://dx.doi.org/10.1093/bioinformatics/btx387] [PMID: 28172617]
[http://dx.doi.org/10.18632/oncotarget.17028] [PMID: 28938573]
[http://dx.doi.org/10.1093/bioinformatics/btx278] [PMID: 28444139]
[PMID: 31593226]
[http://dx.doi.org/10.1093/bioinformatics/btaa166] [PMID: 32154836]
[http://dx.doi.org/10.3389/fphar.2019.00971] [PMID: 31543820]
[http://dx.doi.org/10.4236/abb.2020.115012]
[http://dx.doi.org/10.2174/1381612824666181112113438] [PMID: 30417778]
[http://dx.doi.org/10.1093/bioinformatics/btab204] [PMID: 33769479]
[http://dx.doi.org/10.1186/s12859-017-1660-6] [PMID: 28617230]
[http://dx.doi.org/10.1093/bioinformatics/btv055] [PMID: 25638810]
[http://dx.doi.org/10.1093/bioinformatics/btt158] [PMID: 23564845]
[http://dx.doi.org/10.1016/j.jbi.2015.09.016] [PMID: 26434987]
[http://dx.doi.org/10.1093/nar/gkt1207] [PMID: 24293645]
[http://dx.doi.org/10.1093/nar/gkw1092] [PMID: 27899662]
[http://dx.doi.org/10.1093/nar/gkq367] [PMID: 20460463]
[http://dx.doi.org/10.1016/j.cels.2016.10.017] [PMID: 27889536]
[http://dx.doi.org/10.1016/j.ajhg.2008.02.013] [PMID: 18371930]
[http://dx.doi.org/10.1109/ICDM.2006.70]
[http://dx.doi.org/10.1186/s12859-015-0774-y] [PMID: 26537615]
[http://dx.doi.org/10.1016/j.neucom.2018.10.028]
[http://dx.doi.org/10.1109/JBHI.2018.2883834] [PMID: 30507518]
[http://dx.doi.org/10.1021/ci00057a005]
[http://dx.doi.org/10.4018/jdwm.2007070101]
[http://dx.doi.org/10.1016/0920-5489(94)90017-5]
[http://dx.doi.org/10.1186/s12859-017-1898-z] [PMID: 29297288]
[http://dx.doi.org/10.1145/2951913.2976746]
[http://dx.doi.org/10.1007/s11704-017-7031-7]
[http://dx.doi.org/10.1007/BF00994018]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.1093/bib/bbz041] [PMID: 31067315]
[http://dx.doi.org/10.3389/fcell.2020.627302] [PMID: 33505977]
[http://dx.doi.org/10.1093/bioinformatics/btu852] [PMID: 25568279]
[http://dx.doi.org/10.1093/bib/bby028] [PMID: 29897410]
[http://dx.doi.org/10.1109/ACCESS.2020.3009439]
[http://dx.doi.org/10.1016/j.mbs.2018.09.010] [PMID: 30296417]
[http://dx.doi.org/10.1155/2012/837245] [PMID: 22701510]
[http://dx.doi.org/10.1162/NECO_a_00571] [PMID: 24479776]