Abstract
Background: Machine learning is an active area of research in computer science by the availability of big data collection of all sorts prompting interest in the development of novel tools for data mining. Machine learning methods have wide applications in computer-aided drug discovery methods. Most incredible approaches to machine learning are used in drug designing, which further aid the process of biological modelling in drug discovery. Mainly, two main categories are present which are Ligand-Based Virtual Screening (LBVS) and Structure-Based Virtual Screening (SBVS), however, the machine learning approaches fall mostly in the category of LBVS.
Objectives: This study exposits the major machine learning approaches being used in LBVS. Moreover, we have introduced a protocol named FP-CADD which depicts a 4-steps rule of thumb for drug discovery, the four protocols of computer-aided drug discovery (FP-CADD). Various important aspects along with SWOT analysis of FP-CADD are also discussed in this article.
Conclusion: By this thorough study, we have observed that in LBVS algorithms, Support Vector Machines (SVM) and Random Forest (RF) are those which are widely used due to high accuracy and efficiency. These virtual screening approaches have the potential to revolutionize the drug designing field. Also, we believe that the process flow presented in this study, named FP-CADD, can streamline the whole process of computer-aided drug discovery. By adopting this rule, the studies related to drug discovery can be made homogeneous and this protocol can also be considered as an evaluation criterion in the peer-review process of research articles.
Keywords: Machine learning, computational modelling, drug discovery, drug designing, FP-CADD, computational biology.
Graphical Abstract
[http://dx.doi.org/10.3758/s13428-016-0788-z] [PMID: 27613019]
[http://dx.doi.org/10.1016/j.cbpa.2004.04.005] [PMID: 15183324]
[http://dx.doi.org/10.1016/j.drudis.2011.12.006] [PMID: 22178890]
[http://dx.doi.org/10.1016/j.bmc.2007.10.048] [PMID: 17996450]
[http://dx.doi.org/10.2174/09298673113209990001] [PMID: 23651302]
[http://dx.doi.org/10.1021/ci700461s] [PMID: 18318473]
[http://dx.doi.org/10.1021/ci900419k] [PMID: 20088575]
[http://dx.doi.org/10.1038/nrd1156] [PMID: 12904816]
[http://dx.doi.org/10.1016/j.drudis.2007.01.011] [PMID: 17331887]
[http://dx.doi.org/10.1038/nrd941] [PMID: 12415248]
[http://dx.doi.org/10.1021/ci700087v] [PMID: 17622131]
[http://dx.doi.org/10.1016/j.drudis.2006.10.005] [PMID: 17129822]
[http://dx.doi.org/10.1021/jm0603365] [PMID: 17201411]
[http://dx.doi.org/10.1186/s13321-018-0302-y] [PMID: 30288626]
[http://dx.doi.org/10.1002/wcms.1183] [PMID: 25285160]
[http://dx.doi.org/10.1021/ci0340916] [PMID: 14632457]
[http://dx.doi.org/10.1021/ci049641u] [PMID: 15921445]
[http://dx.doi.org/10.1021/ci100073w] [PMID: 21207929]
[http://dx.doi.org/10.1016/j.rse.2006.04.001]
[http://dx.doi.org/10.1021/ci9003865] [PMID: 20387860]
[http://dx.doi.org/10.1007/s10994-015-5517-9]
[http://dx.doi.org/10.1021/ci060138m] [PMID: 16995731]
[http://dx.doi.org/10.1021/ci050367t] [PMID: 16562992]
[http://dx.doi.org/10.1006/jmps.1999.1278] [PMID: 10733859]
[http://dx.doi.org/10.1021/ci9002624] [PMID: 19780576]
[http://dx.doi.org/10.1007/s00894-010-0854-x] [PMID: 20857153]
[http://dx.doi.org/10.1021/ci020058s] [PMID: 12653517]
[http://dx.doi.org/10.1021/ci700351y] [PMID: 18269264]
[http://dx.doi.org/10.1007/s10994-014-5451-2] [PMID: 28919667]
[http://dx.doi.org/10.1007/BF00058655]
[http://dx.doi.org/10.1093/bioinformatics/btq112] [PMID: 20236947]
[http://dx.doi.org/10.1093/protein/6.7.723] [PMID: 7504257]
[http://dx.doi.org/10.1016/j.bbrc.2005.06.087] [PMID: 16002049]
[http://dx.doi.org/10.1016/j.drudis.2014.10.012] [PMID: 25448759]
[http://dx.doi.org/10.1021/ci049628+] [PMID: 15921467]
[http://dx.doi.org/10.1021/ci050358k] [PMID: 16562981]
[http://dx.doi.org/10.1021/ci800079x] [PMID: 19055411]
[http://dx.doi.org/10.1021/ci100090p] [PMID: 20504032]
[http://dx.doi.org/10.1021/ci2004994] [PMID: 22098146]
[http://dx.doi.org/10.1177/146045829500100206]
[http://dx.doi.org/10.2174/157488407781668811] [PMID: 18690868]
[http://dx.doi.org/10.1093/bib/bbn021] [PMID: 18460474]
[http://dx.doi.org/10.1124/dmd.108.023507] [PMID: 19056915]
[http://dx.doi.org/10.1371/journal.pone.0015079] [PMID: 21152409]
[http://dx.doi.org/10.1093/nar/26.9.2230] [PMID: 9547285]
[http://dx.doi.org/10.1016/S0097-8485(01)00099-7] [PMID: 11765852]
[http://dx.doi.org/10.1016/j.bmc.2012.04.024] [PMID: 22595424]
[http://dx.doi.org/10.1021/ci0600657] [PMID: 17125175]
[http://dx.doi.org/10.1016/j.chemolab.2013.05.012]
[http://dx.doi.org/10.1016/j.cmpb.2014.08.009] [PMID: 25224081]
[http://dx.doi.org/10.1021/ci900337f] [PMID: 19961205]
[http://dx.doi.org/10.1016/j.drudis.2013.11.012] [PMID: 24269836]
[http://dx.doi.org/10.1074/mcp.R800006-MCP200] [PMID: 18676365]
[http://dx.doi.org/10.1016/S0958-1669(97)80126-7] [PMID: 9425663]
[http://dx.doi.org/10.1016/j.chembiol.2012.01.002] [PMID: 22284356]
[http://dx.doi.org/10.1016/S1359-6446(99)01306-9] [PMID: 10322265]
[http://dx.doi.org/10.1016/S1359-6446(05)03438-0] [PMID: 15894228]
[http://dx.doi.org/10.1371/journal.pone.0181966] [PMID: 28797096]
[http://dx.doi.org/10.1007/s00232-016-9937-7] [PMID: 27866233]
[http://dx.doi.org/10.1016/j.ab.2018.04.021] [PMID: 29704476]
[http://dx.doi.org/10.1080/07391102.2014.968875] [PMID: 25248923]
[http://dx.doi.org/10.1016/j.bbrc.2007.08.140] [PMID: 17880924]
[http://dx.doi.org/10.18632/oncotarget.10027] [PMID: 27322424]
[http://dx.doi.org/10.3390/ijms15057594] [PMID: 24857907]
[http://dx.doi.org/10.1016/j.jtbi.2016.01.020] [PMID: 26807806]
[http://dx.doi.org/10.18632/oncotarget.9987] [PMID: 27323404]
[http://dx.doi.org/10.1093/bioinformatics/btw380] [PMID: 27334473]
[http://dx.doi.org/10.1093/bioinformatics/btw387] [PMID: 27354696]
[http://dx.doi.org/10.1016/j.jtbi.2018.10.021] [PMID: 30312687]
[http://dx.doi.org/10.1093/bioinformatics/bty668] [PMID: 30052767]
[http://dx.doi.org/10.1016/j.chemolab.2018.11.012]
[PMID: 30520961]
[PMID: 29897410]
[http://dx.doi.org/10.1016/j.jtbi.2018.01.023] [PMID: 29408627]
[PMID: 30550813]
[http://dx.doi.org/10.1016/j.jtbi.2018.04.037] [PMID: 29727634]
[http://dx.doi.org/10.1007/s11033-018-4391-5] [PMID: 30238411]
[http://dx.doi.org/10.1038/s41598-018-19491-y] [PMID: 29348418]
[http://dx.doi.org/10.2174/1381612825666181127101039] [PMID: 30479209]
[http://dx.doi.org/10.1016/j.ab.2018.12.019] [PMID: 30593778]
[http://dx.doi.org/10.1016/j.jtbi.2019.02.007] [PMID: 30768975]
[PMID: 30550863]
[http://dx.doi.org/10.1007/s11033-018-4417-z] [PMID: 30311130]
[http://dx.doi.org/10.1007/s11224-020-01507-x]
[http://dx.doi.org/10.2174/1570163817666200312102659] [PMID: 32164512]
[PMID: 31838993]
[http://dx.doi.org/10.1038/nrd.2017.232] [PMID: 29242609]
[http://dx.doi.org/10.2174/1573409915666190130164923]
[http://dx.doi.org/10.4172/1948-5964.1000177]
[http://dx.doi.org/10.1007/s13337-018-0446-4] [PMID: 29911147]
[http://dx.doi.org/10.4103/0972-9062.217617] [PMID: 29097641]
[http://dx.doi.org/10.1016/j.jmgm.2017.12.011] [PMID: 29331879]
[http://dx.doi.org/10.1590/1678-4324-2018180004]
[http://dx.doi.org/10.2174/1570163816666190214161825]
[http://dx.doi.org/10.1093/nar/gkv951] [PMID: 26400175]
[http://dx.doi.org/10.1021/acs.jcim.5b00559] [PMID: 26479676]
[http://dx.doi.org/10.1007/s12551-016-0247-1] [PMID: 28510083]
[http://dx.doi.org/10.1016/j.cbi.2006.12.006] [PMID: 17229415]
[http://dx.doi.org/10.1038/aps.2012.109] [PMID: 22922346]
[http://dx.doi.org/10.1016/j.drudis.2009.01.003] [PMID: 19185058]
[http://dx.doi.org/10.1016/j.chemphys.2008.10.036]
[http://dx.doi.org/10.11144/Javeriana.SC24-3.artf]