Novel DNA Sequence Comparison Method Based on Markov Chain and Information Entropy

Zhao-Hui      Qi; Meng-Zhe      Jin; Jia-Shuo      Wang; Su-Li      Li

doi:10.2174/1570193X13666151218191633

Abstract

The comparison of DNA sequences is the basic topic in computational biology and bioinformatics, helping in speculation about their previously ambiguous structure, function, and evolution relationship. In this article, we provide a novel DNA sequence comparison scheme by constructing feature vectors based on Markov chain and information entropy. A new measure, which is calculated as the entropy of K-string’s four one-step transition probabilities, is used to compose the feature vector to characterize DNA sequence. At the same time, we provide a novel concept to address the computation burden caused by the exponential growth of computation complexity when K grows in a traditional K-string model, which is named K-string list. The proposed scheme allows us to conduct similarity research and phylogenetic analysis on two real datasets, the first exon of 11 species’

Keywords: DNA sequence comparison, entropy, feature vector, K-string list, markov model, phylogenetic analysis.

« Previous Next »

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

17

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1570193X13666151218191633	Print ISSN 1570-193X
Publisher Name Bentham Science Publisher	Online ISSN 1875-6298

Mini-Reviews in Organic Chemistry

Novel DNA Sequence Comparison Method Based on Markov Chain and Information Entropy

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Abstract