Abstract
Peptides act as promising anticancer agents due to their ease of synthesis and modifications, enhanced tumor penetration, and less systemic toxicity. However, only limited success has been achieved so far, as experimental design and synthesis of anticancer peptides (ACPs) are prohibitively costly and time-consuming. Furthermore, the sequential increase in the protein sequence data via highthroughput sequencing makes it difficult to identify ACPs only through experimentation, which often involves months or years of speculation and failure. All these limitations could be overcome by applying machine learning (ML) approaches, which is a field of artificial intelligence that automates analytical model building for rapid and accurate outcome predictions. Recently, ML approaches hold great promise in the rapid discovery of ACPs, which could be witnessed by the growing number of MLbased anticancer prediction tools. In this review, we aim to provide a comprehensive view on the existing ML approaches for ACP predictions. Initially, we will briefly discuss the currently available ACP databases. This is followed by the main text, where state-of-the-art ML approaches working principles and their performances based on the ML algorithms are reviewed. Lastly, we discuss the limitations and future directions of the ML methods in the prediction of ACPs.
Keywords: Cancer, anticancer peptides, machine learning, support vector machine, random forest, ACPs.
Graphical Abstract
[http://dx.doi.org/10.1016/j.ejca.2013.09.002] [PMID: 24120180]
[http://dx.doi.org/10.1155/2012/967347] [PMID: 23316341]
[PMID: 27593958]
[http://dx.doi.org/10.3390/ijms20081964] [PMID: 31013619]
[http://dx.doi.org/10.18632/oncotarget.20365] [PMID: 29100375]
[PMID: 27016393]
[http://dx.doi.org/10.1371/journal.pone.0181748] [PMID: 28759605]
[http://dx.doi.org/10.3389/fphar.2018.00954] [PMID: 30210341]
[http://dx.doi.org/10.1016/bs.apcsb.2018.01.006] [PMID: 29680238]
[http://dx.doi.org/10.1016/j.drudis.2009.10.009] [PMID: 19879957]
[http://dx.doi.org/10.1002/med.20252] [PMID: 21922503]
[http://dx.doi.org/10.2174/092986712801661004] [PMID: 22725698]
[http://dx.doi.org/10.1021/mp300167e] [PMID: 22591113]
[http://dx.doi.org/10.1093/nar/gku892] [PMID: 25270878]
[http://dx.doi.org/10.3389/fmicb.2013.00294] [PMID: 24101917]
[http://dx.doi.org/10.3389/fchem.2017.00005] [PMID: 28271058]
[http://dx.doi.org/10.18632/oncotarget.7815] [PMID: 26942877]
[http://dx.doi.org/10.1038/srep33910] [PMID: 27670968]
[http://dx.doi.org/10.1038/srep02984] [PMID: 24136089]
[http://dx.doi.org/10.1007/s10989-014-9435-7]
[http://dx.doi.org/10.1093/bioinformatics/bty451] [PMID: 29868903]
[http://dx.doi.org/10.3390/genes9030158] [PMID: 29534013]
[http://dx.doi.org/10.1371/journal.pcbi.1004074] [PMID: 25849257]
[http://dx.doi.org/10.1093/nar/gkv1278] [PMID: 26602694]
[http://dx.doi.org/10.1093/bioinformatics/bts141] [PMID: 22467909]
[http://dx.doi.org/10.1111/1574-6968.12489] [PMID: 24888447]
[http://dx.doi.org/10.1038/srep24482] [PMID: 27075512]
[http://dx.doi.org/10.1371/journal.pone.0066557] [PMID: 23825543]
[PMID: 27777222]
[PMID: 29897410]
[PMID: 30388198]
[http://dx.doi.org/10.1093/bioinformatics/bty155] [PMID: 29547915]
[PMID: 30590442]
[http://dx.doi.org/10.1093/bioinformatics/btz358] [PMID: 31077296]
[http://dx.doi.org/10.1093/bioinformatics/btz408] [PMID: 31099381]
[http://dx.doi.org/10.1016/j.jtbi.2013.08.037] [PMID: 24035842]
[http://dx.doi.org/10.1016/j.artmed.2017.06.008] [PMID: 28655440]
[http://dx.doi.org/10.1093/bioinformatics/btz246] [PMID: 30994882]
[http://dx.doi.org/10.1007/BF00994018]
[http://dx.doi.org/10.1093/bib/bbz041] [PMID: 31067315]
[http://dx.doi.org/10.1093/bioinformatics/btz432] [PMID: 31135038]
[http://dx.doi.org/10.4155/fmc-2017-0300] [PMID: 30039980]
[http://dx.doi.org/10.1016/j.compbiolchem.2019.05.006] [PMID: 31146118]
[http://dx.doi.org/10.1016/j.compbiolchem.2019.05.008] [PMID: 31151025]
[http://dx.doi.org/10.3389/fmicb.2018.00476] [PMID: 29616000]
[http://dx.doi.org/10.1016/j.omtn.2018.03.012] [PMID: 29858081]
[http://dx.doi.org/10.1093/bioinformatics/bty827] [PMID: 30247625]
[http://dx.doi.org/10.1016/j.omtn.2017.03.006] [PMID: 28624191]
[http://dx.doi.org/10.1093/bioinformatics/bty508] [PMID: 29931187]
[http://dx.doi.org/10.7150/ijbs.24174] [PMID: 29989085]
[http://dx.doi.org/10.7150/ijbs.24616] [PMID: 29989083]
[http://dx.doi.org/10.1093/bioinformatics/btz015] [PMID: 30624619]
[http://dx.doi.org/10.1155/2013/530696] [PMID: 23762187]
[http://dx.doi.org/10.1155/2013/567529] [PMID: 24062796]
[http://dx.doi.org/10.1016/j.omtn.2019.04.019]
[PMID: 30590410]
[http://dx.doi.org/10.1093/bib/bby091] [PMID: 30239616]
[http://dx.doi.org/10.1093/bib/bby107] [PMID: 30383239]
[http://dx.doi.org/10.1371/journal.pone.0106542] [PMID: 25222008]
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]
[http://dx.doi.org/10.1021/acs.jproteome.8b00148] [PMID: 29893128]
[http://dx.doi.org/10.3389/fgene.2019.00129] [PMID: 30891059]
[http://dx.doi.org/10.1371/journal.pone.0200283] [PMID: 30312302]
[http://dx.doi.org/10.3390/molecules23071667] [PMID: 29987232]
[http://dx.doi.org/10.1039/C7MB00491E] [PMID: 28990628]
[http://dx.doi.org/10.1093/bioinformatics/btx479] [PMID: 28961687]
[http://dx.doi.org/10.3389/fimmu.2018.01695] [PMID: 30100904]
[http://dx.doi.org/10.3389/fimmu.2018.01783] [PMID: 30108593]
[http://dx.doi.org/10.1016/j.csbj.2018.10.007] [PMID: 30425802]
[http://dx.doi.org/10.3390/cells8020095] [PMID: 30696115]
[http://dx.doi.org/10.2174/1389200219666181031105916] [PMID: 30378494]
[http://dx.doi.org/10.1016/j.ab.2013.05.024] [PMID: 23756733]
[http://dx.doi.org/10.1093/bioinformatics/bty140] [PMID: 29528364]
[http://dx.doi.org/10.1093/nar/gkv458] [PMID: 25958395]
[http://dx.doi.org/10.1021/ci400127q] [PMID: 24047419]