Abstract
Background/Objective: Protein-protein interactions are essentials for most cellular processes and thus, unveiling how proteins interact with is a crucial question that can be better understood by recognizing which residues participate in the interaction. Although many computational approaches have been proposed to predict interface residues, their feature perspective and model learning ability are not enough to achieve ideal results. So, our objective is to improve the predictive performance under considering feature perspective and new learning algorithm.
Method: In this study, we proposed an ensemble deep convolutional neural network, which explores the context and positional context of consecutive residues within a protein sub-sequence. Specifically, unlike the feature view of previous methods, ConvsPPIS uses evolutionary, physicochemical, and structural protein characteristics to construct their own feature graph respectively. After that, three independent deep convolutional neural networks are trained on each type of feature graph for learning the underlying pattern in sub-sequence. Lastly, we integrated those three deep networks into an ensemble predictor with leveraging complementary information of those features to predict potential interface residues.
Results: Some comparative experiments have conducted through 10-fold cross-validation. The results indicated that ConvsPPIS achieved superior performance on DBv5-Sel dataset with an accuracy of 88%. Additional experiments on CAPRI-Alone dataset demonstrated ConvsPPIS has also better prediction performance.
Conclusion: The ConvsPPIS method provided a new perspective to capture protein feature expression for identifying protein-protein interaction sites. The results proved the superiority of this method.
Keywords: Feature graph, positional context, protein complex, interface prediction, convolution neural network, ensemble learning.
Graphical Abstract
[http://dx.doi.org/10.1093/bioinformatics/btx005] [PMID: 28073761]
[http://dx.doi.org/10.1002/prot.24479] [PMID: 24243399]
[http://dx.doi.org/10.1006/jmbi.2000.4092] [PMID: 10993732]
[http://dx.doi.org/10.1016/S0014-5793(03)00456-3] [PMID: 12782323]
[http://dx.doi.org/10.1093/bioinformatics/bth920]
[http://dx.doi.org/10.1093/bioinformatics/bti340] [PMID: 15728113]
[http://dx.doi.org/10.1093/bioinformatics/btl303] [PMID: 17237081]
[http://dx.doi.org/10.1006/jmbi.1997.1233] [PMID: 9299343]
[http://dx.doi.org/10.1371/journal.pcbi.1000278] [PMID: 19180183]
[http://dx.doi.org/10.1093/bioinformatics/btx044] [PMID: 28130235]
[http://dx.doi.org/10.1371/journal.pone.0043927] [PMID: 22937126]
[http://dx.doi.org/10.1002/prot.1099] [PMID: 11455607]
[http://dx.doi.org/10.1046/j.1432-1033.2002.02767.x] [PMID: 11874449]
[http://dx.doi.org/10.1186/1471-2105-10-426] [PMID: 20015386]
[http://dx.doi.org/10.1093/bioinformatics/btp039] [PMID: 19153136]
[http://dx.doi.org/10.1093/bioinformatics/btq302] [PMID: 20529890]
[http://dx.doi.org/10.1093/bioinformatics/btx585] [PMID: 28968673]
[http://dx.doi.org/10.1186/1748-7188-4-13] [PMID: 19849839]
[http://dx.doi.org/10.1016/j.jtbi.2014.01.028] [PMID: 24486250]
[http://dx.doi.org/10.1016/j.neucom.2016.02.022]
[http://dx.doi.org/10.1093/bioinformatics/bti242] [PMID: 15613384]
[http://dx.doi.org/10.1186/1471-2105-15-277] [PMID: 25124108]
[http://dx.doi.org/10.1093/bioinformatics/btl660] [PMID: 17234636]
[http://dx.doi.org/10.1002/prot.20514] [PMID: 16080151]
[http://dx.doi.org/10.1038/nbt.3300] [PMID: 26213851]
[http://dx.doi.org/10.1101/gr.200535.115] [PMID: 27197224]
[http://dx.doi.org/10.1016/j.jmb.2015.07.016] [PMID: 26231283]
[http://dx.doi.org/10.2174/138920308785132712] [PMID: 18691126]
[http://dx.doi.org/10.1186/1472-6807-8-21] [PMID: 18400099]
[http://dx.doi.org/10.1073/pnas.84.13.4355] [PMID: 3474607]
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
[http://dx.doi.org/10.1093/protein/gzg072] [PMID: 12968073]
[http://dx.doi.org/10.1186/s13015-015-0033-9] [PMID: 25713596]
[http://dx.doi.org/10.1007/BF01025492]
[http://dx.doi.org/10.1561/9781601982957]
[http://dx.doi.org/10.1016/j.aci.2018.08.003]
[http://dx.doi.org/10.1002/prot.21248] [PMID: 17152079]
[http://dx.doi.org/10.1186/1471-2105-9-553] [PMID: 19102736]
[http://dx.doi.org/10.1093/bioinformatics/btm434] [PMID: 17895276]