Abstract
The representation system for protein conformation has a crucial effect on the speed of various protein-related simulations, including ab initio protein structure prediction and protein-protein docking simulation. Usually, the finer a representation system, the longer is the computational time required to employ the representation system in simulations. On the other hand, very coarse lattice systems cannot be directly applied to the simulation problems with real proteins. We report a new, fragment library-based protein conformation representation system, prepared by clustering amino acid conformations from 154 proteins. This system was composed of 64 most representative fragments per each amino acid, and based on the unified residue approach in which two spheres per amino acid were used. It could represent the conformation of the 82 proteins in an independent test set with the mean and standard deviation RMSD of 1.01 and 0.09 Å, respectively, based on the position of alpha carbons and the centers of mass of sidechains.
Keywords: Protein conformation, Representation system, Fragment library, Self-organizing map, Clustering