Abstract
Background: As a new type of protein acylation modification, lysine glutarylation has been found to play a crucial role in metabolic processes and mitochondrial functions. To further explore the biological mechanisms and functions of glutarylation, it is significant to predict the potential glutarylation sites. In the existing glutarylation site predictors, experimentally verified glutarylation sites are treated as positive samples and non-verified lysine sites as the negative samples to train predictors. However, the non-verified lysine sites may contain some glutarylation sites which have not been experimentally identified yet.
Methods: In this study, experimentally verified glutarylation sites are treated as the positive samples, whereas the remaining non-verified lysine sites are treated as unlabeled samples. A bioinformatics tool named PUL-GLU was developed to identify glutarylation sites using a positive-unlabeled learning algorithm.
Results: Experimental results show that PUL-GLU significantly outperforms the current glutarylation site predictors. Therefore, PUL-GLU can be a powerful tool for accurate identification of protein glutarylation sites.
Conclusion: A user-friendly web-server for PUL-GLU is available at http://bioinform.cn/pul_glu/.
Keywords: Post-translational modification, glutarylation, support vector machine, positive-unlabeled learning, protein acylation, site predictors.
Graphical Abstract
[http://dx.doi.org/10.1074/mcp.M700021-MCP200] [PMID: 17267393]
[http://dx.doi.org/10.1016/j.cell.2011.08.008] [PMID: 21925322]
[http://dx.doi.org/10.1038/nchembio.495] [PMID: 21151122]
[http://dx.doi.org/10.1038/nrm3841] [PMID: 25053359]
[http://dx.doi.org/10.1038/nchembio.1497] [PMID: 24681537]
[http://dx.doi.org/10.1074/mcp.R114.046664] [PMID: 25717114]
[http://dx.doi.org/10.1016/j.cmet.2014.03.014] [PMID: 24703693]
[http://dx.doi.org/10.1021/acs.jproteome.5b00917] [PMID: 26903315]
[http://dx.doi.org/10.1016/j.ab.2018.04.005] [PMID: 29641975]
[http://dx.doi.org/10.1109/TNB.2018.2848673] [PMID: 29994125]
[http://dx.doi.org/10.1186/s12859-018-2394-9] [PMID: 30717647]
[http://dx.doi.org/10.1039/C9MO00028C] [PMID: 31025681]
[http://dx.doi.org/10.1093/bioinformatics/btl441] [PMID: 16945945]
[http://dx.doi.org/10.1109/TPAMI.2005.159] [PMID: 16119262]
[http://dx.doi.org/10.1021/acs.jproteome.9b00226] [PMID: 31267738]
[http://dx.doi.org/10.1016/j.ygeno.2019.02.006]
[http://dx.doi.org/10.1016/j.jtbi.2010.12.024] [PMID: 21168420]
[http://dx.doi.org/10.2174/0929867326666190507082559] [PMID: 31060481]
[http://dx.doi.org/10.2174/1568026619666191018100141]
[http://dx.doi.org/10.1093/bioinformatics/btl158] [PMID: 16731699]
[http://dx.doi.org/10.1073/pnas.0408677102] [PMID: 15851683]
[http://dx.doi.org/10.1093/nar/26.8.1974] [PMID: 9518491]
[http://dx.doi.org/10.1016/j.ab.2017.07.011] [PMID: 28709899]
[http://dx.doi.org/10.1016/j.gene.2018.04.055] [PMID: 29694908]
[http://dx.doi.org/10.1145/1961189.1961199]
[http://dx.doi.org/10.1145/775047.775083]
[http://dx.doi.org/10.1186/1471-2105-9-57] [PMID: 18221567]
[http://dx.doi.org/10.1186/1471-2105-11-228] [PMID: 20444264]
[http://dx.doi.org/10.1093/bioinformatics/bts504] [PMID: 22923290]
[http://dx.doi.org/10.1371/journal.pone.0097079] [PMID: 24816822]
[http://dx.doi.org/10.1186/s12859-019-2700-1] [PMID: 30841845]