Abstract
Background: Virtual screening of candidate drug molecules using machine learning techniques plays a key role in pharmaceutical industry to design and discovery of new drugs. Computational classification methods can determine drug types according to the disease groups and distinguish approved drugs from withdrawn ones.
Introduction: Classification models developed in this study can be used as a simple filter in drug modelling to eliminate potentially inappropriate molecules in the early stages. In this work, we developed a Drug Decision Support System (DDSS) to classify each drug candidate molecule as potentially drug or non-drug and to predict its disease group.
Methods: Molecular descriptors were identified for the determination of a number of rules in drug molecules. They were derived using ADRIANA.Code program and Lipinski's rule of five. We used Artificial Neural Network (ANN) to classify drug molecules correctly according to the types of diseases. Closed frequent molecular structures in the form of subgraph fragments were also obtained with Gaston algorithm included in ParMol Package to find common molecular fragments for withdrawn drugs.
Results: We observed that TPSA, XlogP Natoms, HDon_O and TPSA are the most distinctive features in the pool of the molecular descriptors and evaluated the performances of classifiers on all datasets and found that classification accuracies are very high on all the datasets. Neural network models achieved 84.6% and 83.3% accuracies on test sets including cardiac therapy, anti-epileptics and anti-parkinson drugs with approved and withdrawn drugs for drug classification problems.
Conclusion: The experimental evaluation shows that the system is promising at determination of potential drug molecules to classify drug molecules correctly according to the types of diseases.
Keywords: Drug design, molecular descriptors, artificial neural network, ADRIANA.Code, data mining, frequent subgraph mining.
Graphical Abstract
[http://dx.doi.org/10.1016/j.tips.2013.08.004] [PMID: 24035675]
[http://dx.doi.org/10.1016/j.copbio.2007.10.005] [PMID: 18035532]
[http://dx.doi.org/10.1016/j.drudis.2014.10.012] [PMID: 25448759]
[http://dx.doi.org/10.1126/science.287.5460.1960] [PMID: 10720314]
[http://dx.doi.org/10.1007/978-0-387-69094-0]
[http://dx.doi.org/10.2174/138920307781369427] [PMID: 17696867]
[http://dx.doi.org/10.1016/S1359-6446(99)01451-8] [PMID: 10652455]
[http://dx.doi.org/10.1016/j.drudis.2013.02.008] [PMID: 23458995]
[http://dx.doi.org/10.2174/09298673113209990001] [PMID: 23651302]
[http://dx.doi.org/10.1016/j.addr.2006.09.005] [PMID: 17081647]
[http://dx.doi.org/10.1017/S0033583515000190] [PMID: 26328949]
[http://dx.doi.org/10.1021/ci7002076] [PMID: 17929911]
[http://dx.doi.org/10.1016/j.drudis.2019.03.015] [PMID: 30890362]
[http://dx.doi.org/10.1016/j.drudis.2014.01.005] [PMID: 24451293]
[http://dx.doi.org/10.1016/S0169-409X(00)00129-0] [PMID: 11259830]
[http://dx.doi.org/10.1016/S0169-409X(00)00129-0] [PMID: 11259830]
[http://dx.doi.org/10.1016/S1056-8719(00)00107-6] [PMID: 11274893]
[http://dx.doi.org/10.1016/j.eswa.2012.02.063]
[http://dx.doi.org/10.1007/s10822-006-9096-5] [PMID: 17205373]
[http://dx.doi.org/10.1016/j.drudis.2017.08.010] [PMID: 28881183]
[http://dx.doi.org/10.1016/j.bmcl.2018.06.046] [PMID: 30122222]
[http://dx.doi.org/10.1021/ci049875d] [PMID: 15446842]
[http://dx.doi.org/10.1016/j.patcog.2015.08.014]
[http://dx.doi.org/10.1016/j.eswa.2016.12.008]
[http://dx.doi.org/10.1021/ci049869h] [PMID: 15446820]
[http://dx.doi.org/10.1109/TSMCB.2007.895334] [PMID: 17702280]
[http://dx.doi.org/10.1021/ci010073h] [PMID: 11749582]
[http://dx.doi.org/10.1021/ci0001482] [PMID: 11277704]
[http://dx.doi.org/10.7861/clinmedicine.10-1-54] [PMID: 20408309]
[http://dx.doi.org/10.1016/j.maturitas.2013.05.019] [PMID: 23827471]
[http://dx.doi.org/10.1016/j.neulet.2009.12.025] [PMID: 20026245]
[http://dx.doi.org/10.1136/bmjopen-2013-004221] [PMID: 24435895]
[http://dx.doi.org/10.1093/nar/gkv1192] [PMID: 26553801]
[http://dx.doi.org/10.1016/j.drudis.2012.07.016] [PMID: 22889967]
[http://dx.doi.org/10.1109/TITB.2007.907985] [PMID: 18632325]
[http://dx.doi.org/10.1038/nchembio747] [PMID: 16370374]
[http://dx.doi.org/10.1016/S0097-8485(01)00094-8] [PMID: 11765851]
[http://dx.doi.org/10.1016/j.cmpb.2014.08.009] [PMID: 25224081]
[http://dx.doi.org/10.1016/j.asoc.2014.09.042]
[http://dx.doi.org/10.4051/ibc.2012.4.1.0002]
[http://dx.doi.org/10.1016/S0920-1211(98)00076-X] [PMID: 10022362]
[http://dx.doi.org/10.1021/ci900419k] [PMID: 20088575]
[http://dx.doi.org/10.1016/j.drudis.2018.05.010] [PMID: 29750902]
[http://dx.doi.org/10.1016/j.compbiolchem.2017.03.008] [PMID: 28391063]
[http://dx.doi.org/10.1002/wcms.1183] [PMID: 25285160]
[http://dx.doi.org/10.1002/0470015535]
[http://dx.doi.org/10.1016/j.drudis.2018.01.039] [PMID: 29366762]
[http://dx.doi.org/10.1016/j.eswa.2018.12.032]
[http://dx.doi.org/10.1145/1014052.1014134]
[http://dx.doi.org/10.1021/acs.chemrestox.5b00260] [PMID: 26360911]
[http://dx.doi.org/10.1186/1471-2164-12-S5-S11] [PMID: 22369493]
[http://dx.doi.org/10.1186/s12916-014-0262-7] [PMID: 25651859]
[http://dx.doi.org/10.1186/1471-2105-12-169] [PMID: 21586169]
[http://dx.doi.org/10.1021/ci050358k] [PMID: 16562981]
[http://dx.doi.org/10.1016/j.neucom.2015.08.054]
[http://dx.doi.org/10.1007/s002280100265] [PMID: 11372600]
[http://dx.doi.org/10.2165/00002018-200629020-00008] [PMID: 16454545]
[http://dx.doi.org/10.1002/pds.1248] [PMID: 16700082]
[http://dx.doi.org/10.1186/s12916-016-0553-2] [PMID: 26843061]
[http://dx.doi.org/10.1111/j.1365-2125.2005.02504.x] [PMID: 16305591]
[http://dx.doi.org/10.1016/j.bbalip.2010.07.007] [PMID: 20691280]