Abstract
With the advent of ultra-high-throughput sequencing technologies, ChIP-seq is becoming the main stream for the genome-wide studies of transcription factor binding sites (TFBSs) and histone modification sites. Computational analysis of ChIP-seq data is important for ChIP-seq applications. In this chapter, we first give an overview of the stateof- the-art ChIP-seq analysis tools developed for predicting ChIP-enriched genomic sites. Next, we describe the methods employed in a comprehensive analysis on Chen et al.’s ChIP-seq dataset in mouse embryonic stem cells (mESC) [1]. These methods include the prediction of transcription factor binding peaks, as well as subsequent analysis procedures such as de novo motif-finding and the discovery of transcription factor co-localization. By this, we demonstrate how the computational approaches assist to achieve novel biological discoveries from large-scale ChIP-seq dataset.
Keywords: Computational Analysis, Sequencing, Transcriptional regulation, Chromatin Immuno-precipitation, Histone modification, ChIP-seq, ChIP-enriched loci, Hidden Markov Model, Control library, Poisson distribution, mESC.