Abstract
Background: Immune reaction is the most important defense mechanism for destroying invading pathogens in our body, and the epitope is the position of the antigen–antibody interaction on pathogenic proteins.
Objective: The majority of epitopes are structural; however, the existing sequence-based predicting websites still have several methods to improve the predicting performance. Therefore, in this study, we used SVM as a machine learning tool to predict the epitope-based on protein sequences.
Methods: Firstly, we built five SVM models in the first layer according to five features, including binary composition, position-specific scoring matrix, secondary structure, accessible surface area, and association rule, and then chose the patterns that exhibited the best performance in each model. Secondly, using the confidence score of the first-layer models as the input value for the SVM model in the second layer, that SVM model was integrated into the first-layer SVM models for improving the predicting accuracy.
Results: The final prediction model was able to achieve up to 63% accuracy in predicting epitope results, and the predicting performance was better than that achieved by the existing predicting websites.
Conclusion: Finally, a case study using a two-subunit cytochrome c oxidase of Paracoccus denitrificans was tested, achieving an accuracy of up to 66%.
Keywords: Structural epitope, support vector machines, association rule, position-specific scoring matrix, immune, pathogens.
Graphical Abstract
[http://dx.doi.org/10.1073/pnas.93.1.7] [PMID: 8552677]
[http://dx.doi.org/10.1038/322747a0] [PMID: 2427953]
[http://dx.doi.org/10.1128/IAI.69.7.4329-4336.2001] [PMID: 11401970]
[http://dx.doi.org/10.1186/1745-7580-2-2] [PMID: 16635264]
[http://dx.doi.org/10.1002/jmr.893] [PMID: 18496882]
[http://dx.doi.org/10.1007/s00726-006-0485-9] [PMID: 17252308]
[http://dx.doi.org/10.1002/jmr.771] [PMID: 16598694]
[http://dx.doi.org/10.1186/1471-2164-11-S4-S21] [PMID: 21143805]
[http://dx.doi.org/10.1142/9781848162648_0011] [PMID: 19642274]
[http://dx.doi.org/10.1073/pnas.78.6.3824] [PMID: 6167991]
[http://dx.doi.org/10.1128/JVI.55.3.836-839.1985] [PMID: 2991600]
[http://dx.doi.org/10.1155/2014/689219] [PMID: 25045691]
[http://dx.doi.org/10.1371/journal.pone.0043575] [PMID: 22927994]
[http://dx.doi.org/10.1016/j.compbiolchem.2014.02.002] [PMID: 24607818]
[http://dx.doi.org/10.1186/1471-2105-16-S18-S12] [PMID: 26681157]
[http://dx.doi.org/10.1002/prot.21387] [PMID: 17427229]
[http://dx.doi.org/10.1016/j.jim.2015.09.006] [PMID: 26455801]
[http://dx.doi.org/10.1002/prot.21078] [PMID: 16894596]
[http://dx.doi.org/10.1093/bioinformatics/btu281] [PMID: 24931993]
[http://dx.doi.org/10.1110/ps.062405906] [PMID: 17001032]
[http://dx.doi.org/10.1186/1471-2105-12-341] [PMID: 21846404]
[PMID: 15980448]
[http://dx.doi.org/10.1186/1745-7580-6-6] [PMID: 20961417]
[http://dx.doi.org/10.1093/nar/gkp417] [PMID: 19465377]
[http://dx.doi.org/10.1038/nbt0308-274] [PMID: 18327232]
[http://dx.doi.org/10.2174/1574893611666151119221435]
[http://dx.doi.org/10.1145/1961189.1961199]
[http://dx.doi.org/10.1093/nar/gkj157] [PMID: 16381837]
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
[http://dx.doi.org/10.1186/1472-6807-9-51] [PMID: 19646261]
[http://dx.doi.org/10.1145/1656274.1656278]
[http://dx.doi.org/10.1073/pnas.94.20.10547] [PMID: 9380672]