Abstract
Background: Identifying differentially methylated region (DMR) is a basic but important task in epigenomics, which can help investigate the mechanisms of diseases and provide methylation biomarkers for screening diseases. A set of methods have been proposed to identify DMRs from methylation array data. However, it lacks effective metrics to characterize different DMR sets and enable a straight way for comparison.
Methods: In this study, we introduce a metric, DMRn, to characterize DMR sets detected by different methods from methylation array data. To calculate DMRn, firstly, the methylation differences of DMRs are recalculated by incorporating the correlations between probes and their represented CpGs. Then, DMRn is calculated based on the number of probes and the dense of CpGs in DMRs with methylation differences falling in each interval.
Result & Discussion: By comparing the DMRn of DMR sets predicted by seven methods on four scenario, the results demonstrate that DMRn can make an efficient guidance for selecting DMR sets, and provide new insights in cancer genomics studies by comparing the DMR sets from the related pathological states. For example, there are many regions with subtle methylation alteration in subtypes of prostate cancer are altered oppositely in the benign state, which may indicate a possible revision mechanism in benign prostate cancer.
Conclusion: Futhermore, when applied to datasets that underwent different runs of batch effect removal, the DMRn can help to visualize the bias introduced by multi-runs of batch effect removal. The tool for calculating DMRn is available in the GitHub repository(https://github.com/xqpeng/DMRArrayMetric).
[http://dx.doi.org/10.1038/srep08257]
[http://dx.doi.org/10.4161/epi.5.6.12228] [PMID: 20505344]
[http://dx.doi.org/10.1093/bfgp/elw017] [PMID: 27416614]
[http://dx.doi.org/10.1016/j.csbj.2021.08.014] [PMID: 34471503]
[http://dx.doi.org/10.1093/bioinformatics/btx622] [PMID: 29028927]
[http://dx.doi.org/10.1007/s00401-019-01966-5] [PMID: 30712078]
[http://dx.doi.org/10.1371/journal.pone.0002698] [PMID: 18628954]
[http://dx.doi.org/10.1002/pros.23093] [PMID: 26383847]
[http://dx.doi.org/10.1093/nar/gkab957] [PMID: 34669946]
[http://dx.doi.org/10.1109/TCSS.2022.3216483]
[http://dx.doi.org/10.1109/TCBB.2016.2635144]
[http://dx.doi.org/10.1049/cje.2021.06.002]
[http://dx.doi.org/10.3389/fgene.2021.697279] [PMID: 34262601]
[http://dx.doi.org/10.1093/bioinformatics/btaa732] [PMID: 32805005]
[http://dx.doi.org/10.1186/1756-8935-8-6] [PMID: 25972926]
[http://dx.doi.org/10.1093/bioinformatics/btt498] [PMID: 23990415]
[http://dx.doi.org/10.1093/bioinformatics/btx467] [PMID: 29036320]
[http://dx.doi.org/10.1093/bioinformatics/bts545] [PMID: 22954632]
[http://dx.doi.org/10.1016/j.ymeth.2014.10.036] [PMID: 25461817]
[http://dx.doi.org/10.2174/1574893615999200724145835]
[http://dx.doi.org/10.1093/bioinformatics/btu049] [PMID: 24478339]
[http://dx.doi.org/10.1186/s13059-019-1664-9] [PMID: 30871603]
[http://dx.doi.org/10.1093/nar/gkt242] [PMID: 23598999]
[http://dx.doi.org/10.1093/nar/gkz590] [PMID: 31291459]
[http://dx.doi.org/10.1093/bioinformatics/bts013] [PMID: 22253290]
[http://dx.doi.org/10.1093/bioinformatics/btz096] [PMID: 30753302]
[http://dx.doi.org/10.1093/ije/dyr238] [PMID: 22422453]
[http://dx.doi.org/10.1093/nar/gkr053] [PMID: 21306990]
[http://dx.doi.org/10.1093/bioinformatics/btw304] [PMID: 27187204]
[http://dx.doi.org/10.2174/1574893617666220404145517]
[http://dx.doi.org/10.1007/s11704-020-0180-0]
[http://dx.doi.org/10.1093/bib/bby085] [PMID: 30239597]
[http://dx.doi.org/10.1186/s12859-015-0641-x] [PMID: 26156501]
[http://dx.doi.org/10.1093/bib/bbaa060] [PMID: 32427285]
[http://dx.doi.org/10.1186/s12916-021-02109-y] [PMID: 34641873]
[http://dx.doi.org/10.1073/pnas.1703577114] [PMID: 28652331]
[http://dx.doi.org/10.1186/s12885-019-5403-0] [PMID: 30866861]
[http://dx.doi.org/10.1093/bib/bbab475] [PMID: 34874989]
[http://dx.doi.org/10.3389/fonc.2018.00100] [PMID: 29740534]
[http://dx.doi.org/10.1186/s13148-021-01155-w] [PMID: 34454584]
[http://dx.doi.org/10.1186/1756-8935-6-26] [PMID: 23919675]
[http://dx.doi.org/10.1186/gb-2014-15-4-r54] [PMID: 24690455]
[http://dx.doi.org/10.2217/epi.16.8] [PMID: 27004446]
[http://dx.doi.org/10.1093/bioinformatics/btx513] [PMID: 28961746]
[http://dx.doi.org/10.1093/bioinformatics/bts034] [PMID: 22257669]
[http://dx.doi.org/10.1038/s41467-020-20225-w] [PMID: 33339831]
[http://dx.doi.org/10.1038/nmeth.2238] [PMID: 23281567]