Abstract
The determination of a proteins biochemical function from its 3D structure has proved more difficult than anticipated for structural genomics proteins, most of which are of unknown or uncertain function. Functional annotations typically have been assigned using the closest sequence or structure match, a practice that has resulted in large numbers of misannotated proteins. Recently it was reported that computed protonation properties can be used to predict the residues with catalytic and binding activity, thus providing clues about the function of the protein. We show that residues with anomalous computed protonation behavior constitute a small fraction of the proteins highly conserved residues. Results for a test set of 61 proteins reveal that the average conservation scores are high for residues with unusual protonation behavior, even for many not annotated as functionally important in the literature. Two enzymes, protein tyrosine phosphatase from Yersinia enterocolitica and glucosamine-6-phosphate deaminase from Escherichia coli, are described in detail as examples to illustrate the relationship between anomalous protonation behavior and conservation. We conclude that the residues with anomalous protonation behavior are generally highly conserved, but are fewer in number and more spatially localized than the set of all highly conserved residues in a given protein.
Keywords: Protonation states, functional residues, sequence conservation, function annotation, THEMATICS