Abstract
Aims: Post-Translational Modifications (PTMs), which include more than 450 types, can be regarded as the fundamental cellular regulation.
Background: Recently, experiments demonstrated that the lysine malonylation modification is a significant process in several organisms and cells. Meanwhile, malonylation plays an important role in the regulation of protein subcellular localization, stability, translocation to lipid rafts and many other protein functions.
Objective: Identification of malonylation will contribute to understanding the molecular mechanism in the field of biology. Nevertheless, several existing experimental approaches, which can hardly meet the need of the high speed data generation, are expensive and time-consuming. Moreover, some machine learning methods can hardly meet the high-accuracy need in this issue.
Methods: In this study, we proposed a method, named MSIT that means malonylation sites identification tree, utilized the amino acid residues and profile information to identify the lysine malonylation sites with the tree structural neural network in the peptides sequence level.
Results: The proposed algorithm can get 0.8699 of F1 score and 89.34% in true positive ratio in E. coli. MSIT outperformed existing malonylation site identification methods and features on different species datasets.
Conclusion: Based on these measures, it can be demonstrated that MSIT will be helpful in identifying candidate malonylation sites.
Keywords: Post translational modification, modification sites identification, flexible neural tree, malonylation, subcellular, localization, amino acid.
Graphical Abstract
[http://dx.doi.org/10.1038/nmeth1100] [PMID: 17901869]
[http://dx.doi.org/10.1002/anie.200501023] [PMID: 16267872]
[http://dx.doi.org/10.1586/14737159.3.4.411] [PMID: 12877381]
[http://dx.doi.org/10.1016/j.clinbiochem.2010.03.015] [PMID: 20381482]
[http://dx.doi.org/10.1038/nmeth731] [PMID: 15789031]
[http://dx.doi.org/10.1016/S1367-5931(98)80095-1] [PMID: 9818190]
[http://dx.doi.org/10.1111/j.1742-4658.2010.07992.x] [PMID: 21205204]
[http://dx.doi.org/10.1074/jbc.M213066200] [PMID: 12654918]
[http://dx.doi.org/10.1038/sj.leu.2401657] [PMID: 10637471]
[http://dx.doi.org/10.1016/j.tibs.2006.12.004] [PMID: 17208443]
[http://dx.doi.org/10.1016/j.bbapap.2007.07.008] [PMID: 17728195]
[PMID: 3276689]
[http://dx.doi.org/10.1038/47412] [PMID: 10638745]
[http://dx.doi.org/10.1073/pnas.93.18.9821] [PMID: 8790415]
[http://dx.doi.org/10.1146/annurev.immunol.18.1.621] [PMID: 10837071]
[http://dx.doi.org/10.1101/sqb.2007.72.024] [PMID: 18419308]
[http://dx.doi.org/10.1038/nrm1761] [PMID: 16261189]
[http://dx.doi.org/10.1074/mcp.M111.012658]
[http://dx.doi.org/10.1038/nchembio.495] [PMID: 21151122]
[http://dx.doi.org/10.1016/j.cell.2007.02.005] [PMID: 17320507]
[http://dx.doi.org/10.1016/j.molmed.2010.09.002] [PMID: 20932800]
[http://dx.doi.org/10.1016/j.cbpa.2006.11.022] [PMID: 17157550]
[http://dx.doi.org/10.1074/jbc.R114.595439] [PMID: 25336654]
[http://dx.doi.org/10.1016/j.cell.2008.05.020] [PMID: 18510935 ]
[http://dx.doi.org/10.1038/nrm2298] [PMID: 18037899]
[http://dx.doi.org/10.1016/j.ceb.2007.04.002] [PMID: 17466502]
[http://dx.doi.org/10.1038/nature04815] [PMID: 16728976]
[http://dx.doi.org/10.1016/j.cell.2005.03.036] [PMID: 15960974]
[http://dx.doi.org/10.1016/S0014-5793(01)03309-9] [PMID: 11911891]
[http://dx.doi.org/10.1515/revneuro-2013-0008] [PMID: 23729618]
[http://dx.doi.org/10.1126/science.1063127] [PMID: 11498575]
[http://dx.doi.org/10.1093/abbs/gmr100] [PMID: 22194010]
[http://dx.doi.org/10.1016/j.tig.2003.09.007] [PMID: 14585615]
[http://dx.doi.org/10.1038/ng1966] [PMID: 17277777]
[http://dx.doi.org/10.1038/nrm3293] [PMID: 22395773]
[http://dx.doi.org/10.1091/mbc.e05-01-0033] [PMID: 16079181]
[http://dx.doi.org/10.1016/j.cell.2006.06.057] [PMID: 16959573]
[http://dx.doi.org/10.1038/nature06736] [PMID: 18337721]
[http://dx.doi.org/10.1074/jbc.M413296200] [PMID: 15795229]
[http://dx.doi.org/10.1016/j.str.2007.02.002] [PMID: 17355872]
[http://dx.doi.org/10.1002/anie.201300252] [PMID: 23533089]
[http://dx.doi.org/10.1074/mcp.R114.046664] [PMID: 25717114]
[PMID: 28113983]
[http://dx.doi.org/10.1109/TCBB.2013.10] [PMID: 23929869]
[http://dx.doi.org/10.1109/TCBB.2011.79] [PMID: 21519114]
[http://dx.doi.org/10.1109/TCBB.2015.2476790] [PMID: 26415208]
[http://dx.doi.org/10.1074/mcp.M110.001388] [PMID: 20702892]
[http://dx.doi.org/10.6026/97320630006204] [PMID: 21738315]
[http://dx.doi.org/10.1093/bioinformatics/btu852] [PMID: 25568279]
[http://dx.doi.org/10.1093/nar/27.1.368] [PMID: 9847231]
[http://dx.doi.org/10.1038/npre.2010.5092.1]
[http://dx.doi.org/10.1002/humu.20021] [PMID: 15108278]
[PMID: 9181472]
[PMID: 28025199]
[http://dx.doi.org/10.1038/srep38318] [PMID: 27910954]
[http://dx.doi.org/10.1093/bioinformatics/btu852] [PMID: 25568279]