Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm

Bi-Qing      Li; Yu-Hang      Zhang; Mei-Ling      Jin; Tao      Huang; Yu-Dong      Cai

doi:10.2174/1574893611666160711162006

Abstract

Background: As a crucial component of the entire protein-protein interaction (PPI) network, protein-peptide interactions are ubiquitous in living cells. These interactions play important roles in signaling transduction and regulation. Compared with laborious and time-consuming experimental approaches, predicting protein-peptide interactions with effective computational methods could be convenient and rapid.

Method: This study proposed a novel method for the prediction of interactions between proteins and peptides using various features extracted from both proteins and peptides. The traditional amino acid composition as well as pseudo-amino acid composition and features derived from 205 domains were utilized to represent a protein-peptide interaction. The predictor was constructed based on four different machine learning algorithms including SMO (sequential minimal optimization), IB1 (nearest neighbor algorithm), dagging, and random forest (RF). All features were analyzed by some feature selection technologies, such as the maximum relevance minimum redundancy method and the incremental feature selection method, to extract optimal features. Additionally, an optimal predictor based on IB1 was constructed according to the extracted optimal features.

Results: MCC values of 0.4436 for the cross-validation test of the training set and 0.4444 for the independent test set were obtained with the IB1 algorithm. Different encoding methods were compared. The domain-based method outperformed the pseudo-amino acid composition method. An optimal feature set of 230 features was selected, which contributed most to the prediction of the protein-peptide pairs.

Conclusion: Several important domains related to some features in the optimal feature set were deemed to play key roles in determining the protein-peptide interactions.

Keywords: Protein-peptide interactions, maximum relevance minimum redundancy, incremental feature selection, functional domain composition, pseudo-amino acid composition.

« Previous Next »

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

55

4

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1574893611666160711162006	Print ISSN 1574-8936
Publisher Name Bentham Science Publisher	Online ISSN 2212-392X

Current Bioinformatics

Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Abstract