Abstract
Based on the classification of 20 amino acids, we reduce a protein primary sequence to six (0,1) sequences. For each of them, two so-called normalized relative-entropies are calculated and thus a 12-D vector is constructed to describe the protein primary sequence. The examination of similarities/dissimilarities among eight different proteins illustrates the utility of the approach.
Keywords: Protein, (0,1)-sequence, normalized relative-entropy