Abstract
Bioinformatics research in the last three decades has contributed quite a large number of methodologies for compressing genomic sequence data. However, recent progress in the next generation sequencing (NGS) techniques requires the development of more effective compression methods. In this review paper, a comprehensive overview of the state-of-the-art DNA sequence compression techniques for handling the exponential growth of DNA sequence data, emerging from NGS techniques, is provided.
Keywords: Cloud computing, next generation sequencing, sequence compression, DNA sequence data, NGS techniques, Arabidopsis thaliana, compression ratio, single nucleotide polymorphisms, genetic disease, genome