Abstract
Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine amongthe most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identify lysine methylation sites is an important challenge as some experimental procedures are time-consuming.
Objective: Herein, we propose a computational predictor named iMethylK-PseAAC to identify lysine methylation sites.
Methods: Firstly, we constructed feature vectors based on PseAAC using position and composition relative features and statistical moments. A neural network is trained based on the extracted features. The performance of the proposed method is then validated using cross-validation and jackknife testing.
Results: The objective evaluation of the predictor showed accuracy of 96.7% for self-consistency, 91.61% for 10-fold cross-validation and 93.42% for jackknife testing.
Conclusion: It is concluded that iMethylK-PseAAC outperforms the counterparts to identify lysine methylation sites such as iMethyl-PseACC, BPB-PPMS and PMeS.
Keywords: Methylation, lysine methylation, PseAAC, statistical moments, 5-steps rule, prediction.
Graphical Abstract
[http://dx.doi.org/10.1016/0006-291X(67)90533-5] [PMID: 6055181]
[http://dx.doi.org/10.2174/1568026615666150819110421] [PMID: 26286211]
[http://dx.doi.org/10.1016/0955-0674(93)90080-A] [PMID: 8129951]
[http://dx.doi.org/10.1016/j.ceb.2004.04.002] [PMID: 15145346]
[http://dx.doi.org/10.1210/er.2004-0008] [PMID: 15479858]
[http://dx.doi.org/10.1007/s00395-006-0592-5] [PMID: 16705470]
[http://dx.doi.org/10.1523/JNEUROSCI.3349-06.2006] [PMID: 17079667]
[http://dx.doi.org/10.1007/s00018-008-8605-1] [PMID: 19370393]
[http://dx.doi.org/10.1046/j.1432-1327.1999.00532.x] [PMID: 10406966]
[http://dx.doi.org/10.1038/nature04048] [PMID: 16121170]
[http://dx.doi.org/10.1371/journal.pone.0181966] [PMID: 28797096]
[http://dx.doi.org/10.1155/2016/8370132]
[http://dx.doi.org/10.1007/s00232-016-9937-7] [PMID: 27866233]
[http://dx.doi.org/10.1007/s11033-018-4391-5] [PMID: 30238411]
[http://dx.doi.org/10.1016/j.ab.2018.12.019] [PMID: 30593778]
[http://dx.doi.org/10.1007/s00521-013-1372-4]
[PMID: 30550863]
[http://dx.doi.org/10.1016/j.ab.2018.04.021] [PMID: 29704476]
[http://dx.doi.org/10.1007/s11033-018-4417-z] [PMID: 30311130]
[http://dx.doi.org/10.1016/j.jtbi.2019.02.007] [PMID: 30768975]
[http://dx.doi.org/10.2174/1381612825666181127101039] [PMID: 30479209]
[http://dx.doi.org/10.1016/j.jtbi.2016.02.020] [PMID: 26908349]
[http://dx.doi.org/10.1016/j.jmgm.2017.08.020] [PMID: 28886434]
[http://dx.doi.org/10.2174/1573406413666170515120507] [PMID: 28521678]
[http://dx.doi.org/10.2174/1573406413666170623082245] [PMID: 28641529]
[http://dx.doi.org/10.1038/s41598-018-36203-8] [PMID: 30560923]
[http://dx.doi.org/10.1016/j.jtbi.2018.10.046] [PMID: 30365947]
[http://dx.doi.org/10.1016/j.jtbi.2018.07.018] [PMID: 30031793]
[http://dx.doi.org/10.1016/j.ab.2018.09.002] [PMID: 30201554]
[http://dx.doi.org/10.1016/j.ab.2015.08.021] [PMID: 26314792]
[http://dx.doi.org/10.1016/j.omtn.2018.03.012] [PMID: 29858081]
[PMID: 28427142]
[http://dx.doi.org/10.1016/j.omtn.2017.03.006] [PMID: 28624191]
[PMID: 29360500]
[http://dx.doi.org/10.3390/ijms150610410] [PMID: 24918295]
[http://dx.doi.org/10.1016/j.ab.2015.12.009] [PMID: 26723495]
[http://dx.doi.org/10.1016/j.jtbi.2016.01.020] [PMID: 26807806]
[http://dx.doi.org/10.18632/oncotarget.9148] [PMID: 27153555]
[http://dx.doi.org/10.1093/bioinformatics/btw387] [PMID: 27354696]
[http://dx.doi.org/10.1016/j.gene.2018.04.055] [PMID: 29694908]
[http://dx.doi.org/10.1016/j.ab.2015.12.017] [PMID: 26748145]
[http://dx.doi.org/10.1002/minf.201600010] [PMID: 28488814]
[http://dx.doi.org/10.18632/oncotarget.17104] [PMID: 28476023]
[http://dx.doi.org/10.18632/oncotarget.10027] [PMID: 27322424]
[http://dx.doi.org/10.1093/bioinformatics/btw380] [PMID: 27334473]
[http://dx.doi.org/10.1080/07391102.2014.968875] [PMID: 25248923]
[http://dx.doi.org/10.18632/oncotarget.9987] [PMID: 27323404]
[http://dx.doi.org/10.1016/j.jtbi.2018.04.037] [PMID: 29727634]
[http://dx.doi.org/10.1093/protein/gzt042] [PMID: 24048266]
[http://dx.doi.org/10.1371/journal.pone.0055844] [PMID: 23409062]
[http://dx.doi.org/10.7717/peerj.171] [PMID: 24109555]
[http://dx.doi.org/10.2174/1573406413666170419150052] [PMID: 28425870]
[http://dx.doi.org/10.3390/ijms15057594] [PMID: 24857907]
[http://dx.doi.org/10.1371/journal.pone.0105018] [PMID: 25121969]
[http://dx.doi.org/10.3390/ijms150711204] [PMID: 24968264]
[http://dx.doi.org/10.1038/s41598-018-19491-y] [PMID: 29348418]
[PMID: 30593778]
[http://dx.doi.org/10.1093/bib/bby089]
[http://dx.doi.org/10.2174/1573406411666141229162834] [PMID: 25548930]
[http://dx.doi.org/10.1186/s12859-019-2700-1]
[PMID: 29107015]
[http://dx.doi.org/10.1093/bib/bby079] [PMID: 30351377]
[http://dx.doi.org/10.1371/journal.pone.0039308] [PMID: 22720092]
[http://dx.doi.org/10.1109/BIBE.2009.22]
[http://dx.doi.org/10.1371/journal.pone.0004920] [PMID: 19290060]
[PMID: 21544797]
[http://dx.doi.org/10.1371/journal.pone.0038772] [PMID: 22719939]
[http://dx.doi.org/10.1109/JBHI.2014.2298351] [PMID: 24808224]
[http://dx.doi.org/10.1039/c3ay41962b]
[http://dx.doi.org/10.1109/IJCNN.2017.7966050]
[http://dx.doi.org/10.1155/2014/875879]
[http://dx.doi.org/10.1155/2014/723595]
[http://dx.doi.org/10.1016/j.jtbi.2010.12.024] [PMID: 21168420]
[http://dx.doi.org/10.1093/protein/14.2.75] [PMID: 11297664]
[PMID: 30010789]
[http://dx.doi.org/10.1016/j.jtbi.2018.09.005] [PMID: 30201434]
[PMID: 29842950]
[http://dx.doi.org/10.1016/j.ygeno.2018.08.007] [PMID: 30179658]
[http://dx.doi.org/10.1016/j.jtbi.2018.07.032] [PMID: 30056084]
[http://dx.doi.org/10.1016/j.jtbi.2018.05.033] [PMID: 29870696]
[http://dx.doi.org/10.1016/j.ygeno.2018.09.004] [PMID: 30196077]
[http://dx.doi.org/10.18632/oncotarget.13758] [PMID: 27926534]
[http://dx.doi.org/10.2174/1381612824666181119145030] [PMID: 30451108]
[http://dx.doi.org/10.1016/j.ab.2013.05.024] [PMID: 23756733]
[http://dx.doi.org/10.1016/j.jtbi.2018.10.021] [PMID: 30312687]
[http://dx.doi.org/10.1093/nar/gku1019] [PMID: 25361964]
[http://dx.doi.org/10.1016/j.jtbi.2015.08.025]
[http://dx.doi.org/10.2174/0929867326666190507082559]
[http://dx.doi.org/10.1093/nar/gkh131]
[http://dx.doi.org/10.1093/nar/gkv1240] [PMID: 26578568]
[http://dx.doi.org/10.1093/bioinformatics/bts565] [PMID: 23060610]
[http://dx.doi.org/10.1101/gr.849004] [PMID: 15173120]
[http://dx.doi.org/10.1038/srep40242] [PMID: 28079126]
[http://dx.doi.org/10.18632/oncotarget.14524] [PMID: 28076851]
[http://dx.doi.org/10.1155/2013/530696]
[http://dx.doi.org/10.7717/peerj.171] [PMID: 24109555]
[http://dx.doi.org/10.1016/j.ygeno.2015.12.005] [PMID: 26724497]
[http://dx.doi.org/10.1002/minf.201600010] [PMID: 28488814]
[http://dx.doi.org/10.18632/oncotarget.9057] [PMID: 27147572]
[http://dx.doi.org/10.1093/nar/gku1019] [PMID: 25361964]
[http://dx.doi.org/10.1371/journal.pone.0105018] [PMID: 25121969]
[http://dx.doi.org/10.1016/j.jtbi.2016.01.020] [PMID: 26807806]
[http://dx.doi.org/10.18632/oncotarget.11975] [PMID: 27626500]
[http://dx.doi.org/10.18632/oncotarget.7815] [PMID: 26942877]
[http://dx.doi.org/10.1016/j.omtn.2017.04.008] [PMID: 28624202]
[http://dx.doi.org/10.1093/bioinformatics/btw539] [PMID: 27531102]
[http://dx.doi.org/10.18632/oncotarget.13758] [PMID: 27926534]
[http://dx.doi.org/10.1016/j.omtn.2017.03.006] [PMID: 28624191]
[http://dx.doi.org/10.1093/bioinformatics/btx579] [PMID: 28968797]
[http://dx.doi.org/10.1038/s41598-018-19491-y] [PMID: 29348418]
[http://dx.doi.org/10.1016/j.ygeno.2018.01.005] [PMID: 29360500]
[http://dx.doi.org/10.1093/bioinformatics/btw539] [PMID: 27531102]
[PMID: 29897410]
[http://dx.doi.org/10.1016/j.omtn.2017.04.008] [PMID: 28624202]
[http://dx.doi.org/10.1039/C1MB05420A] [PMID: 22134333]
[http://dx.doi.org/10.1039/c3mb25466f] [PMID: 23370050]
[http://dx.doi.org/10.1016/j.jtbi.2011.06.005] [PMID: 21684290]
[http://dx.doi.org/10.1016/j.ab.2013.01.019] [PMID: 23395824]
[http://dx.doi.org/10.1039/c3mb25555g] [PMID: 23536215]
[PMID: 28818512]
[http://dx.doi.org/10.1039/C7MB00267J] [PMID: 28702580]
[http://dx.doi.org/10.1016/j.gene.2017.07.036] [PMID: 28728979]
[http://dx.doi.org/10.1093/bioinformatics/btx711] [PMID: 29106451]
[http://dx.doi.org/10.1016/j.ygeno.2017.10.002] [PMID: 28989035]
[http://dx.doi.org/10.1093/bioinformatics/btx476] [PMID: 29036535]
[http://dx.doi.org/10.4236/ns.2017.99032]
[http://dx.doi.org/10.1093/bioinformatics/btx387] [PMID: 28172617]
[http://dx.doi.org/10.1039/c3mb25555g]
[http://dx.doi.org/10.32614/RJ-2016-042]
[http://dx.doi.org/10.1016/j.ygeno.2018.12.001] [PMID: 30529532]
[PMID: 30388198]
[http://dx.doi.org/10.1016/j.jtbi.2018.08.030] [PMID: 30138632]
[http://dx.doi.org/10.1016/j.jtbi.2018.01.023] [PMID: 29408627]
[http://dx.doi.org/10.1016/j.jmgm.2017.12.011] [PMID: 29331879]
[http://dx.doi.org/10.1038/s41598-018-20819-x] [PMID: 29402983]
[http://dx.doi.org/10.1093/bib/bby077] [PMID: 30184176]
[http://dx.doi.org/10.1093/bioinformatics/bty522] [PMID: 29947803]
[http://dx.doi.org/10.1016/j.jtbi.2018.02.008] [PMID: 29476832]
[http://dx.doi.org/10.1016/j.jtbi.2018.01.008] [PMID: 29337263]
[http://dx.doi.org/10.1016/0006-2952(94)90077-9]
[http://dx.doi.org/10.1042/bj1870829]
[http://dx.doi.org/10.4236/ns.2011.310111]
[http://dx.doi.org/10.2174/138920010791514261]
[http://dx.doi.org/10.1139/v81-107]
[http://dx.doi.org/10.1042/bj2220169]
[http://dx.doi.org/10.1016/j.jtbi.2011.06.006]
[http://dx.doi.org/10.1016/0301-4622(80)80002-0]
[http://dx.doi.org/10.1016/0301-4622(80)80003-2]
[http://dx.doi.org/10.4236/ns.2009.12011]
[http://dx.doi.org/10.1016/0301-4622(88)85002-6]
[http://dx.doi.org/10.1093/bioinformatics/btx479] [PMID: 28961687]
[http://dx.doi.org/10.2174/1568026617666170414145508] [PMID: 28413951]