Abstract
Protein-protein interactions (PPIs) are the physical connections between two or more proteins via electrostatic forces or hydrophobic effects. Identification of the PPIs is pivotal, which contributes to many biological processes including protein function, disease incidence, and therapy design. The experimental identification of PPIs via high-throughput technology is time-consuming and expensive. Bioinformatics approaches are expected to solve such restrictions. In this review, our main goal is to provide an inclusive view of the existing sequence-based computational prediction of PPIs. Initially, we briefly introduce the currently available PPI databases and then review the state-of-the-art bioinformatics approaches, working principles, and their performances. Finally, we discuss the caveats and future perspective of the next generation algorithms for the prediction of PPIs.
Keywords: Protein-protein interactions, PPIs database, sequence features, feature selection, machine learning, bioinformatics.
Graphical Abstract
[http://dx.doi.org/10.1371/journal.pcbi.1000807] [PMID: 20589078]
[http://dx.doi.org/10.1016/j.artmed.2019.04.001] [PMID: 31164203]
[http://dx.doi.org/10.1186/s12859-018-2105-6] [PMID: 29510668]
[http://dx.doi.org/10.1039/C7MB00434F] [PMID: 29028058]
[http://dx.doi.org/10.3892/etm.2017.5185] [PMID: 29201163]
[http://dx.doi.org/10.1093/bioinformatics/btt401] [PMID: 23842807]
[http://dx.doi.org/10.2174/09298665113209990050] [PMID: 23855673]
[http://dx.doi.org/10.1039/C1MB05340J] [PMID: 22159132]
[http://dx.doi.org/10.1021/acs.jproteome.9b00074] [PMID: 30983371]
[http://dx.doi.org/10.1080/10245332.2017.1409947] [PMID: 29189103]
[http://dx.doi.org/10.1371/journal.pone.0144163] [PMID: 26641660]
[http://dx.doi.org/10.1093/nar/gky973] [PMID: 30357367]
[http://dx.doi.org/10.1093/nar/gkp914] [PMID: 19884131]
[http://dx.doi.org/10.1074/mcp.R110.000265] [PMID: 20445003]
[http://dx.doi.org/10.1039/C2MB25325A] [PMID: 23104128]
[http://dx.doi.org/10.1146/annurev-biochem-060614-034142] [PMID: 25494300]
[http://dx.doi.org/10.1007/978-1-4939-3591-8_15] [PMID: 26965266]
[http://dx.doi.org/10.1007/978-1-4939-2425-7_32] [PMID: 25859971]
[http://dx.doi.org/10.1089/cmb.2009.0165]
[http://dx.doi.org/10.1128/mSystems.00303-18] [PMID: 30984872]
[http://dx.doi.org/10.1142/S0219720018500257] [PMID: 30400756]
[http://dx.doi.org/10.1016/j.jtbi.2017.08.009] [PMID: 28802824]
[http://dx.doi.org/10.4238/gmr.15028365]
[http://dx.doi.org/10.1186/s13015-015-0033-9] [PMID: 25713596]
[http://dx.doi.org/10.1186/s12859-016-1422-x] [PMID: 28049415]
[http://dx.doi.org/10.1038/srep17004] [PMID: 26608097]
[http://dx.doi.org/10.1007/s10295-014-1462-z] [PMID: 24879479]
[http://dx.doi.org/10.1038/sj.onc.1209458] [PMID: 16518412]
[http://dx.doi.org/10.1002/cfg.365] [PMID: 18629034]
[http://dx.doi.org/10.1371/journal.pone.0005815] [PMID: 19503833]
[http://dx.doi.org/10.1016/j.ab.2009.05.028] [PMID: 19464993]
[http://dx.doi.org/10.1007/s00726-014-1900-2] [PMID: 25540052]
[http://dx.doi.org/10.1109/TCBB.2017.2701824] [PMID: 28504946]
[http://dx.doi.org/10.1109/embc.2018.8513476]
[http://dx.doi.org/10.1016/j.jtbi.2018.06.026] [PMID: 29981337]
[http://dx.doi.org/10.1111/tpj.13874]
[http://dx.doi.org/10.1002/prot.24066] [PMID: 22411607]
[http://dx.doi.org/10.1385/1-59259-816-1:271] [PMID: 15173623]
[http://dx.doi.org/10.1039/c0mb00038h] [PMID: 20714642]
[http://dx.doi.org/10.1186/1471-2105-8-199] [PMID: 17567909]
[http://dx.doi.org/10.1093/bioinformatics/btm208] [PMID: 17646292]
[http://dx.doi.org/10.1093/bioinformatics/btl079] [PMID: 16522669]
[http://dx.doi.org/10.1074/jbc.M007124200] [PMID: 11024056]
[http://dx.doi.org/10.1101/gr.1774904] [PMID: 15173116]
[http://dx.doi.org/10.1002/jcc.25780] [PMID: 30768790]
[http://dx.doi.org/10.1177/1176934319844522] [PMID: 31080346]
[http://dx.doi.org/10.1186/s12859-017-1700-2] [PMID: 28545462]
[http://dx.doi.org/10.2174/092986610789909403] [PMID: 20214637]
[http://dx.doi.org/10.1186/s12859-016-1035-4] [PMID: 27112932]
[http://dx.doi.org/10.1093/bioinformatics/btv737] [PMID: 26677965]
[http://dx.doi.org/10.1093/bioinformatics/btv077] [PMID: 25657331]
[http://dx.doi.org/10.1016/j.ygeno.2013.05.006] [PMID: 23747746]
[http://dx.doi.org/10.1016/j.ygeno.2014.10.006] [PMID: 25458812]
[http://dx.doi.org/10.1016/j.jmb.2004.02.040] [PMID: 15050833]
[http://dx.doi.org/10.1093/bib/bbx123] [PMID: 29028906]
[http://dx.doi.org/10.1093/database/baz005]
[http://dx.doi.org/10.1093/nar/gkw985] [PMID: 27794551]
[http://dx.doi.org/10.1104/pp.15.01821] [PMID: 26620522]
[http://dx.doi.org/10.1093/nar/gky1079] [PMID: 30476227]
[http://dx.doi.org/10.1093/nar/gkw1102] [PMID: 27980099]
[http://dx.doi.org/10.1093/nar/30.1.303] [PMID: 11752321]
[http://dx.doi.org/10.1093/bioinformatics/bty573] [PMID: 30423091]
[http://dx.doi.org/10.1093/bioinformatics/bts565] [PMID: 23060610]
[http://dx.doi.org/10.1186/s12859-019-2907-1] [PMID: 31182027]
[http://dx.doi.org/10.1109/TNB.2018.2797696] [PMID: 29570075]
[http://dx.doi.org/10.1016/j.ymeth.2016.07.018] [PMID: 27476008]
[http://dx.doi.org/10.1186/s12859-018-2525-3] [PMID: 30598096]
[http://dx.doi.org/10.1186/s12859-017-1871-x] [PMID: 29141584]
[http://dx.doi.org/10.1186/1756-0500-3-145] [PMID: 20500905]
[http://dx.doi.org/10.1016/j.jmb.2019.02.017] [PMID: 30796987]
[http://dx.doi.org/10.3389/fgene.2020.00018] [PMID: 32117437]
[http://dx.doi.org/10.1186/1471-2105-15-213] [PMID: 24953126]
[http://dx.doi.org/10.2174/0929866527666200610141258] [PMID: 32520672]
[http://dx.doi.org/10.1016/j.compbiolchem.2020.107238] [PMID: 32114285]
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
[http://dx.doi.org/10.1109/ICIIBMS.2017.8279749]
[http://dx.doi.org/10.3390/ijms17091396] [PMID: 27571061]
[http://dx.doi.org/10.1109/TCYB.2016.2524994] [PMID: 28113829]
[PMID: 17998252]
[http://dx.doi.org/10.1093/nar/28.1.374] [PMID: 10592278]
[http://dx.doi.org/10.1093/bioinformatics/btz587] [PMID: 31350874]
[http://dx.doi.org/10.1155/2019/5238406] [PMID: 31531123]
[http://dx.doi.org/10.1093/bioinformatics/17.suppl_1.S296] [PMID: 11473021]
[http://dx.doi.org/10.1186/1752-0509-5-S1-S8] [PMID: 21689483]
[http://dx.doi.org/10.1371/journal.pcbi.1005717] [PMID: 28846689]
[http://dx.doi.org/10.1371/journal.pone.0129635] [PMID: 26080082]
[http://dx.doi.org/10.1093/bib/bby124] [PMID: 30649170]
[http://dx.doi.org/10.1016/j.compbiolchem.2019.05.008] [PMID: 31151025]
[http://dx.doi.org/10.3390/ijms20225743] [PMID: 31731751]
[http://dx.doi.org/10.4155/fmc-2016-0188] [PMID: 28211294]
[http://dx.doi.org/10.1021/acs.jproteome.8b00148] [PMID: 29893128]
[http://dx.doi.org/10.3389/fimmu.2018.01783] [PMID: 30108593]
[http://dx.doi.org/10.3390/ijms20081964] [PMID: 31013619]
[PMID: 30590410]
[http://dx.doi.org/10.2147/IJN.S140875] [PMID: 28894368]
[http://dx.doi.org/10.1039/C5MB00853K] [PMID: 26739209]
[http://dx.doi.org/10.1093/bioinformatics/btaa160] [PMID: 32145017]
[http://dx.doi.org/10.1007/s11103-020-00988-y] [PMID: 32140819]
[PMID: 31805335]
[http://dx.doi.org/10.1002/1873-3468.13536] [PMID: 31297788]
[http://dx.doi.org/10.3389/fgene.2019.00129] [PMID: 30891059]
[http://dx.doi.org/10.1038/s41598-019-44548-x] [PMID: 31164681]
[http://dx.doi.org/10.3390/cells8020095] [PMID: 30696115]
[http://dx.doi.org/10.3390/molecules23071667] [PMID: 29987232]
[http://dx.doi.org/10.1039/C7MB00491E] [PMID: 28990628]
[http://dx.doi.org/10.2174/0929866525666180905110619] [PMID: 30182830]
[http://dx.doi.org/10.1093/bioinformatics/bty1047] [PMID: 30590410]
[http://dx.doi.org/10.1371/journal.pone.0200283] [PMID: 30312302]
[http://dx.doi.org/10.4155/fmc-2017-0300] [PMID: 30039980]
[http://dx.doi.org/10.1186/s13321-016-0185-8] [PMID: 28053671]
[PMID: 26309399]
[http://dx.doi.org/10.1016/j.neucom.2019.05.013]
[http://dx.doi.org/10.1002/minf.201900130] [PMID: 31908150]
[http://dx.doi.org/10.3389/fmicb.2020.00236] [PMID: 32140149]
[http://dx.doi.org/10.2174/1389203721666200117171403] [PMID: 31957610]
[http://dx.doi.org/10.1016/j.omtn.2019.04.019] [PMID: 31146255]
[http://dx.doi.org/10.1093/bioinformatics/btv160] [PMID: 25788620]
[http://dx.doi.org/10.1093/bioinformatics/19.1.161] [PMID: 12499311]
[http://dx.doi.org/10.1186/s12859-019-3268-5] [PMID: 31874626]
[http://dx.doi.org/10.1002/prot.340180402] [PMID: 8208723]
[http://dx.doi.org/10.1016/j.omtn.2019.08.022] [PMID: 31581051]
[http://dx.doi.org/10.1016/j.jtbi.2017.09.022] [PMID: 28943403]
[http://dx.doi.org/10.1371/journal.pone.0072368] [PMID: 24019868]
[http://dx.doi.org/10.1038/s41598-017-14945-1] [PMID: 29097781]
[http://dx.doi.org/10.1038/s41419-017-0003-x] [PMID: 29305594]
[http://dx.doi.org/10.1093/bioinformatics/btz721] [PMID: 31566664]
[http://dx.doi.org/10.1016/j.csbj.2019.06.024] [PMID: 31372196]
[http://dx.doi.org/10.1002/med.21658] [PMID: 31922268]
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]