Generic placeholder image

Current Protein & Peptide Science

Editor-in-Chief

ISSN (Print): 1389-2037
ISSN (Online): 1875-5550

A Sampling-Based Method for Ranking Protein Structural Models by Integrating Multiple Scores and Features

Author(s): Xiaohu Shi, Jingfen Zhang, Zhiquan He, Yi Shang and Dong Xu

Volume 12, Issue 6, 2011

Page: [540 - 548] Pages: 9

DOI: 10.2174/138920311796957658

Price: $65

Abstract

One of the major challenges in protein tertiary structure prediction is structure quality assessment. In many cases, protein structure prediction tools generate good structural models, but fail to select the best models from a huge number of candidates as the final output. In this study, we developed a sampling-based machine-learning method to rank protein structural models by integrating multiple scores and features. First, features such as predicted secondary structure, solvent accessibility and residue-residue contact information are integrated by two Radial Basis Function (RBF) models trained from different datasets. Then, the two RBF scores and five selected scoring functions developed by others, i.e., Opus-CA, Opus-PSP, DFIRE, RAPDF, and Cheng Score are synthesized by a sampling method. At last, another integrated RBF model ranks the structural models according to the features of sampling distribution. We tested the proposed method by using two different datasets, including the CASP server prediction models of all CASP8 targets and a set of models generated by our in-house software MUFOLD. The test result shows that our method outperforms any individual scoring function on both best model selection, and overall correlation between the predicted ranking and the actual ranking of structural quality.

Keywords: Protein structure prediction, Structural model quality assessment, CASP, Scoring function, Sampling, Machine learning, MUFOLD, protein function, nuclear magnetic resonance spectroscopy, X-ray crystallography


Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy