Abstract
Background: Knowledge of protein functions is very crucial for the understanding of biological processes. Experimental methods for protein function prediction are of no use to treat the growing amount of protein sequence and structure data.
Objective: To develop some computational techniques for the protein function prediction.
Methods: Based on the residue interaction network features and the motion mode information, an SVM model was constructed and used as the predictor. The role of these features was analyzed and some interesting results were obtained.
Results: An alignment-free method for the classification of enzyme and non-enzyme is developed in this work. There is no single feature that occupies a dominant position in the prediction process. The topological and the information-theoretic residue interaction network features have a better performance. The combination of the fast mode and the slow mode can get a better explanation for the classification result.
Conclusion: The method proposed in this paper can act as a classifier for the enzymes and nonenzymes.
Keywords: Protein descriptors, enzyme, motion mode, residue interaction network, support vector machines, protein function.
Graphical Abstract
[http://dx.doi.org/10.2174/1574893612666171121162552]
[http://dx.doi.org/10.1109/TNB.2017.2661756] [PMID: 28166503]
[http://dx.doi.org/10.1002/prot.20284] [PMID: 15495137]
[http://dx.doi.org/10.1016/S0923-2508(00)00123-6 ] [PMID: 10865954]
[http://dx.doi.org/10.2174/1574893613666181107111259]
[http://dx.doi.org/10.1016/j.ins.2016.06.026]
[http://dx.doi.org/10.1093/bioinformatics/bty112] [PMID: 29490018]
[http://dx.doi.org/10.1016/j.ygeno.2007.01.008] [PMID: 17336495]
[http://dx.doi.org/10.2174/092986607781483804] [PMID: 17897089]
[http://dx.doi.org/10.1002/pmic.200700638] [PMID: 18297652]
[http://dx.doi.org/10.1007/s00726-010-0653-9] [PMID: 20563611]
[http://dx.doi.org/10.2174/138920309787847590] [PMID: 19355982]
[http://dx.doi.org/10.1007/s00726-014-1710-6] [PMID: 24623120]
[http://dx.doi.org/10.1093/nar/gkl305]
[http://dx.doi.org/10.1093/nar/gkr284]
[http://dx.doi.org/10.1093/bib/bbw071] [PMID: 27542402]
[http://dx.doi.org/10.1016/j.jmb.2016.10.013] [PMID: 27742592]
[http://dx.doi.org/10.1146/annurev.biophys.093008.131258] [PMID: 20192781]
[http://dx.doi.org/10.1002/prot.24609] [PMID: 24862950]
[http://dx.doi.org/10.1371/journal.pone.0033931] [PMID: 22606222]
[http://dx.doi.org/10.1016/j.sbi.2005.08.007] [PMID: 16143512]
[http://dx.doi.org/10.1103/PhysRevLett.80.2733]
[http://dx.doi.org/10.1103/PhysRevLett.79.3090]
[http://dx.doi.org/10.1006/jmbi.1998.2371] [PMID: 9887265]
[http://dx.doi.org/10.1016/j.polymer.2003.10.080]
[http://dx.doi.org/10.1021/pr0500399] [PMID: 15952744]
[http://dx.doi.org/10.1016/j.bbrc.2007.09.098] [PMID: 17931599]
[http://dx.doi.org/10.1371/journal.pone.0029491] [PMID: 22220213]
[http://dx.doi.org/10.1016/S0022-2836(03)00628-4 ] [PMID: 12850146]
[http://dx.doi.org/10.1103/PhysRevE.75.051903] [PMID: 17677094]
[http://dx.doi.org/10.1093/nar/gkv1236] [PMID: 26582920]
[http://dx.doi.org/10.1109/ICCV.2015.478]
[http://dx.doi.org/10.1016/j.neucom.2014.12.123]
[http://dx.doi.org/10.1016/j.neucom.2016.05.072]
[http://dx.doi.org/10.1145/1961189.1961199]
[http://dx.doi.org/10.1007/978-3-319-232409_1]
[http://dx.doi.org/10.1016/j.patcog.2013.09.010]
[http://dx.doi.org/10.1002/sam.11153]
[http://dx.doi.org/10.1007/s10994-015-5517-9]
[http://dx.doi.org/10.1109/JSTSP.2017.2726981]
[http://dx.doi.org/10.1186/s12859-017-1758-x] [PMID: 28732462]
[http://dx.doi.org/10.1093/bioinformatics/bti1007]
[http://dx.doi.org/10.1016/j.jtbi.2008.06.003] [PMID: 18606172]
[http://dx.doi.org/10.1109/CCAA.2016.7813731]
[http://dx.doi.org/10.1002/prot.24335] [PMID: 23737241]