Generic placeholder image

Protein & Peptide Letters

Editor-in-Chief

ISSN (Print): 0929-8665
ISSN (Online): 1875-5305

Prediction of Protein Subcellular Multi-Localization Based on the General form of Chou’s Pseudo Amino Acid Composition

Author(s): Li-Qi Li, Yuan Zhang, Ling-Yun Zou, Yue Zhou and Xiao-Qi Zheng

Volume 19, Issue 4, 2012

Page: [375 - 387] Pages: 13

DOI: 10.2174/092986612799789369

Price: $65

Abstract

Many proteins bear multi-locational characteristics, and this phenomenon is closely related to biological function. However, most of the existing methods can only deal with single-location proteins. Therefore, an automatic and reliable ensemble classifier for protein subcellular multi-localization is needed. We propose a new ensemble classifier combining the KNN (K-nearest neighbour) and SVM (support vector machine) algorithms to predict the subcellular localization of eukaryotic, Gram-negative bacterial and viral proteins based on the general form of Chou’s pseudo amino acid composition, i.e., GO (gene ontology) annotations, dipeptide composition and AmPseAAC (Amphiphilic pseudo amino acid composition). This ensemble classifier was developed by fusing many basic individual classifiers through a voting system. The overall prediction accuracies obtained by the KNN-SVM ensemble classifier are 95.22, 93.47 and 80.72% for the eukaryotic, Gram-negative bacterial and viral proteins, respectively. Our prediction accuracies are significantly higher than those by previous methods and reveal that our strategy better predicts subcellular locations of multi-location proteins.

Keywords: Amphiphilic pseudo amino acid composition, ensemble classifier, gene ontology, jackknife test, k-nearest neighbor, multiple subcellular localization, support vector machine

Next »

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy