Abstract
In a continuation of our attempt to predict mutations in proteins from influenza A virus, this study attempted to answer the question of whether distinguishing between arginine, leucine and serine can improve the predictability as these residues are governed by different probabilistic mechanism translating from RNA codons to amino acids. In this study, we made the prediction based on the mutation relation among 299 H5N1 hemagglutinins of influenza A virus. Then, we compared the results based on the distinguishing of arginine, leucine and serine with the results without distinguishing of arginine, leucine and serine. The results show that the prediction together with distinguishing between arginine, leucine and serine is better than prediction without distinguishing between these residues.
Keywords: Amino acid, logistic regression, hemagglutinin, influenza, modelling, mutation, prediction, RNA, virus protein