Abstract
The very rapid growth in molecular sequence data from the daily accretion of large gene and protein sequencing projects have led to issues regarding viewing and analyzing the massive amounts of data. Graphical representation and numerical characterization of DNA, RNA and protein sequences have exhibited great potential to address these concerns. We review here in brief several different formulations of these representations and examples of applications to diverse problems based on what this author had presented at the Second Mathematical Chemistry Workshop of the Americas in Bogota, Colombia in 2010. In particular, we note several insights that were gained from such representations, and the applications to the bio-medicinal field.
Keywords: Alignment free sequence comparison, bio-medical applications, chaos generator representation, DNA graphical representation, DNA numerical characterization, DNA sequences global characteristics, DNA visualization, GRANCH applications, GRANCH techniques, identifying potential drug target regions in viruses, lattice graphs in medical applications, long range correlations, long range palindromic sequences, phylogenetic trees, protein graphical representation, protein numerical characterization, purine pyrimidine relationship, QSAR.