Abstract
With the high speed DNA sequencing of genome, databases of genome data continue to grow, and the understanding of genetic variation between individuals grows as well. Single nucleotide polymorphisms (SNPs), the most common type of genetic variation, are an increasingly important resource for understanding the structure and function of the human genome and become a valuable resource for investigating the genetic basis of disease. During the past years, in addition to experimental approaches to characterize specific variants, intense bioinformatics techniques were applied to understand the effects of these genetic changes. In the genetics studies, one intends to understand the molecular basis of disease, and computational methods are becoming increasingly important for SNPs selection, prediction and understanding the downstream effects of genetic variation. The review provides systematic information on the available resources and methods for SNPs discovery and analysis. We also report some new results on DNA sequence-based prediction of SNPs in human cytochrome P450, which serves as an example of computational methods to predict and discover SNPs. Additionally, annotation and prediction of functional SNPs, as well as a comprehensive list of existing tools and online recourses, are reviewed and described.
Keywords: Single nucleotide polymorphism, SNPs prediction, SNPs discovery, SNPs annotation, functional analysis.