Abstract
Based on the primary sequence, by selecting the pseudo amino acid composition, position weight matrix score, the predicted secondary structure and the second neighbor dipeptide composition as characteristic parameters, an approach of diversity increment for predicting 27-class protein folds is proposed. Overall recognition accuracy reaches 61.10% in the independent testing.
Keywords: Increment of diversity, pseudo amino acid composition, position weight matrix, protein fold