Abstract
A novel numerical characterization for graphical representations of DNA sequences is proposed, which consists of two steps: construction of a novel mathematical object that only takes the first row of the traditional mathematical object and then analysis of the similarities/dissimilarities using the distance between the objects of DNA sequences. Our novel mathematical object is only a row vector and calculation of the matrix invariant is avoided, it can significantly lessen the computation. We demonstrate that this novel method can extract enough information of DNA sequences by analyzing the probabilities of six cases. The contrast experiments show that our method has similar results to the traditional methods.
Keywords: Graphical representations of DNA sequences, mathematical object, matrix invariant, sequence comparison, similarity/dissimilarity analysis of DNA sequences.
Graphical Abstract