Abstract
Background: Since December 2019, the emergence of severe acute respiratory syndrome coronavirus 2, which gave rise to coronavirus disease 2019 (COVID-19), has considerably impacted global health. The identification of effective anticoronavirus peptides (ACVPs) and the establishment of robust data storage methods are critical in the fight against COVID-19. Traditional wet-lab peptide discovery approaches are timeconsuming and labor-intensive. With advancements in computer technology and bioinformatics, machine learning has gained prominence in the extraction of functional peptides from extensive datasets.
Methods: In this study, we comprehensively review data resources and predictors related to ACVPs published over the past two decades. In addition, we analyze the influence of various factors on model performance.
Results: We have reviewed nine ACVP-containing databases, which integrate detailed information on protein fragments effective against coronaviruses, providing crucial references for the development of antiviral drugs and vaccines. Additionally, we have assessed 15 peptide predictors for antiviral or specifically anticoronavirus activity. These predictors employ computational models to swiftly screen potential antiviral candidates, offering an efficient pathway for drug development.
Conclusion: Our study provides conclusive results and insights into the performance of different computational methods, and sheds light on the future trajectory of bioinformatics tools for ACVPs. This work offers a representative overview of contributions to the field, with an emphasis on the crucial role of ACVPs in combating COVID-19.
[http://dx.doi.org/10.3390/covid2070063]
[http://dx.doi.org/10.1016/j.bj.2020.04.007] [PMID: 32387617]
[http://dx.doi.org/10.1038/s41577-020-0311-8] [PMID: 32346093]
[http://dx.doi.org/10.1016/S0140-6736(03)13077-2] [PMID: 12711465]
[http://dx.doi.org/10.1016/S0140-6736(15)60454-8] [PMID: 26049252]
[http://dx.doi.org/10.1038/s41586-020-2012-7] [PMID: 32015507]
[http://dx.doi.org/10.1093/bib/bbz107] [PMID: 32978618]
[http://dx.doi.org/10.3390/medsci9020040] [PMID: 34199617]
[http://dx.doi.org/10.14348/molcells.2021.0026] [PMID: 34059561]
[http://dx.doi.org/10.1007/s11739-021-02840-7] [PMID: 34637082]
[http://dx.doi.org/10.1056/NEJMoa2034577] [PMID: 33301246]
[http://dx.doi.org/10.1056/NEJMoa2035389] [PMID: 33378609]
[http://dx.doi.org/10.1016/S0140-6736(20)32661-1] [PMID: 33306989]
[http://dx.doi.org/10.1056/NEJMoa2101544] [PMID: 33882225]
[http://dx.doi.org/10.1056/NEJMc2108829] [PMID: 34260834]
[http://dx.doi.org/10.1002/jmv.25593] [PMID: 31502669]
[http://dx.doi.org/10.1186/s12929-017-0328-x] [PMID: 28320393]
[http://dx.doi.org/10.1111/cbdd.12055] [PMID: 23253135]
[http://dx.doi.org/10.1038/s41597-022-01394-3] [PMID: 35697698]
[http://dx.doi.org/10.1093/nar/gkh025] [PMID: 14681488]
[http://dx.doi.org/10.1093/nar/gkn823] [PMID: 18957441]
[http://dx.doi.org/10.1093/nar/gkv1278] [PMID: 26602694]
[http://dx.doi.org/10.1093/nar/gkp1021] [PMID: 19923233]
[http://dx.doi.org/10.1093/nar/gkt1157] [PMID: 24265220]
[http://dx.doi.org/10.1093/nar/gkv1051] [PMID: 26467475]
[http://dx.doi.org/10.1093/nar/gkac933] [PMID: 36370097]
[http://dx.doi.org/10.1093/nar/gks450] [PMID: 22638580]
[http://dx.doi.org/10.1371/journal.pone.0066557] [PMID: 23825543]
[http://dx.doi.org/10.1111/1574-6968.12489] [PMID: 24888447]
[http://dx.doi.org/10.1093/nar/gkv1174] [PMID: 26578581]
[http://dx.doi.org/10.1093/nar/gkaa991] [PMID: 33151284]
[http://dx.doi.org/10.1093/nar/gkt1191] [PMID: 24285301]
[http://dx.doi.org/10.1038/srep24482] [PMID: 27075512]
[http://dx.doi.org/10.1038/s41597-019-0154-y] [PMID: 31409791]
[http://dx.doi.org/10.1093/nar/gkab651] [PMID: 34390348]
[http://dx.doi.org/10.1093/nar/gky1030] [PMID: 30380085]
[http://dx.doi.org/10.1093/nar/gkab1080] [PMID: 34850155]
[http://dx.doi.org/10.1093/bib/bbab258] [PMID: 34297817]
[http://dx.doi.org/10.1093/bib/bbac265] [PMID: 35772910]
[http://dx.doi.org/10.1371/journal.pone.0070166] [PMID: 23940542]
[http://dx.doi.org/10.1016/j.compbiomed.2019.02.011] [PMID: 30802694]
[http://dx.doi.org/10.3390/ijms20225743] [PMID: 31731751]
[http://dx.doi.org/10.1093/bioinformatics/btz246] [PMID: 30994882]
[http://dx.doi.org/10.1038/s41598-020-76161-8] [PMID: 33159146]
[http://dx.doi.org/10.1109/JBHI.2020.2977091] [PMID: 32142462]
[http://dx.doi.org/10.1093/bioinformatics/btaa275] [PMID: 32348463]
[http://dx.doi.org/10.1007/BF00994018]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.1109/5.726791]
[http://dx.doi.org/10.1093/bib/bbaa423] [PMID: 33497434]
[http://dx.doi.org/10.1093/nar/gku892] [PMID: 25270878]
[http://dx.doi.org/10.3389/fmicb.2018.00323] [PMID: 29535692]
[http://dx.doi.org/10.1109/TCBB.2021.3064630] [PMID: 33687847]
[http://dx.doi.org/10.1093/bib/bbab263] [PMID: 34279599]
[http://dx.doi.org/10.1371/journal.pone.0054908] [PMID: 23359817]
[PMID: 2185863]
[http://dx.doi.org/10.1093/bib/bbab412] [PMID: 34595489]
[http://dx.doi.org/10.1109/JBHI.2021.3130825] [PMID: 34822333]
[http://dx.doi.org/10.1093/nar/gkv1114] [PMID: 26527728]
[http://dx.doi.org/10.1093/bioinformatics/btz260] [PMID: 30994884]
[http://dx.doi.org/10.1093/bib/bbab065] [PMID: 33784381]
[http://dx.doi.org/10.1093/bib/bbab242] [PMID: 34259329]
[http://dx.doi.org/10.1093/bib/bbab422] [PMID: 34670278]
[http://dx.doi.org/10.1093/bib/bbab439] [PMID: 34750606]
[http://dx.doi.org/10.1002/0471725293]
[http://dx.doi.org/10.1093/bioinformatics/btl158] [PMID: 16731699]
[http://dx.doi.org/10.1093/bioinformatics/bts565] [PMID: 23060610]
[http://dx.doi.org/10.1093/bioinformatics/btq003] [PMID: 20053844]
[http://dx.doi.org/10.4236/jbise.2013.64054]
[http://dx.doi.org/10.1016/j.asoc.2004.12.002]
[http://dx.doi.org/10.1093/bioinformatics/btv042] [PMID: 25619996]
[http://dx.doi.org/10.1214/aos/1013203451]
[http://dx.doi.org/10.1371/journal.pone.0136990] [PMID: 26335203]
[http://dx.doi.org/10.1186/1471-2105-8-263] [PMID: 17645800]
[http://dx.doi.org/10.1093/bioinformatics/bty451] [PMID: 29868903]
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]
[http://dx.doi.org/10.1021/acs.jproteome.7b00019] [PMID: 28436664]
[http://dx.doi.org/10.1371/journal.pone.0120066] [PMID: 25781990]
[http://dx.doi.org/10.1155/2017/5761517] [PMID: 29445741]
[http://dx.doi.org/10.1109/TPAMI.2005.159] [PMID: 16119262]
[http://dx.doi.org/10.1063/1.2830030] [PMID: 18282057]
[http://dx.doi.org/10.1109/TCBB.2017.2670558] [PMID: 28222000]
[http://dx.doi.org/10.1093/bib/bbz088] [PMID: 31729528]
[http://dx.doi.org/10.1016/j.neucom.2014.12.123]
[http://dx.doi.org/10.1186/1471-2105-11-S1-S19] [PMID: 20122190]
[http://dx.doi.org/10.1093/nar/gkr1147] [PMID: 22139916]
[http://dx.doi.org/10.3390/ijms20081964] [PMID: 31013619]
[http://dx.doi.org/10.1093/bioinformatics/bth261] [PMID: 15073010]