Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Research Article

SpBLRSR: Schatten p-norm Constrained Bounded Low-rank Subspace Recovery for Predicting N7-methylguanosine (m7G)-disease Associations

Author(s): Jiani Ma, Lin Zhang, Xiangzhi Chen and Hui Liu*

Volume 17, Issue 7, 2022

Published on: 27 August, 2022

Page: [657 - 668] Pages: 12

DOI: 10.2174/1574893617666220617122848

Price: $65

Abstract

Background: As an essential positively charged RNA modification, N7-methylguanosine (m7G) has been reported to be associated with multiple diseases including cancers. While transcriptomewide m7G sites have been identified by high-throughput sequencing approaches, the disease-associated m7G sites are still largely unknown. Therefore, computational methods are urgently needed to predict potential m7G-disease associations, which is crucial for understanding the biosynthetic pathways of tumorigenesis at the epi-transcriptome layer.

Objective: We hope to develop an effective computational method that can accurately predict the associations between m7G sites and diseases, and then to prioritizing candidate m7G sites for novel diseases.

Methods: In this article, we proposed a Schatten p-norm constrained bounded low-rank subspace recovery (SpBLRSR) method for m7G-disease association prediction. An m7G-disease block matrix was built to alleviate the sparseness during the association pattern discovery process. By incorporating the lowrank representation (LRR) model and sparse subspace clustering (SSC) model, SpBLRSR was designed to capture both the global and local structures of the association pattern.

Results Compared with the benchmark methods, SpBLRSR achieved the best performance in predicting associations between m7G sites and disease, and in prioritizing m7G sites for novel diseases. Then the robustness of Schatten p-norm in our method was further validated via a noise contamination experiment. Finally, a case study of breast cancer was performed to elucidate the biological meaning of our method.

Conclusion: SpBLRSR exploits the disease pathogenesis at the epitranscriptome layer by predicting potential m7A sites for disease.

Keywords: m7G-disease association prediction, low-rank subspace recovery, matrix completion, Schatten p-norm

« Previous
Graphical Abstract

[1]
Boccaletto P, Machnicka MA, Purta E, et al. MODOMICS: A database of RNA modification pathways. 2017 update. Nucleic Acids Res 2018; 46(D1): D303-7.
[http://dx.doi.org/10.1093/nar/gkx1030] [PMID: 29106616]
[2]
Cowling VH. Regulation of mRNA cap methylation. Biochem J 2009; 425(2): 295-302.
[http://dx.doi.org/10.1042/BJ20091352] [PMID: 20025612]
[3]
Malbec L, Zhang T, Chen YS, et al. Dynamic methylome of internal mRNA N7-methylguanosine and its regulatory role in translation. Cell Res 2019; 29(11): 927-41.
[http://dx.doi.org/10.1038/s41422-019-0230-z] [PMID: 31520064]
[4]
Guy MP, Phizicky EM. Two-subunit enzymes involved in eukaryotic post-transcriptional tRNA modification. RNA Biol 2014; 11(12): 1608-18.
[http://dx.doi.org/10.1080/15476286.2015.1008360] [PMID: 25625329]
[5]
Sloan KE, Warda AS, Sharma S, Entian KD, Lafontaine DLJ, Bohnsack MT. Tuning the ribosome: The influence of rRNA modification on eukaryotic ribosome biogenesis and function. RNA Biol 2017; 14(9): 1138-52.
[http://dx.doi.org/10.1080/15476286.2016.1259781] [PMID: 27911188]
[6]
Shaheen R, Abdel-Salam GM, Guy MP, et al. Mutation in WDR4 impairs tRNA m(7)G46 methylation and causes a distinct form of micro-cephalic primordial dwarfism. Genome Biol 2015; 16(1): 210.
[http://dx.doi.org/10.1186/s13059-015-0779-x] [PMID: 26416026]
[7]
Lin S, Liu Q, Lelyveld VS, Choe J, Szostak JW, Gregory RI. Mettl1/Wdr4-Mediated m7G tRNA methylome is required for normal mRNA translation and embryonic stem cell self-renewal and differentiation. Mol Cell 2018; 71(2): 244-255.e5.
[http://dx.doi.org/10.1016/j.molcel.2018.06.001] [PMID: 29983320]
[8]
Deng Y, Zhou Z, Ji W, Lin S, Wang M. METTL1-mediated m7G methylation maintains pluripotency in human stem cells and limits mes-oderm differentiation and vascular development. Stem Cell Res Ther 2020; 11(1): 306.
[http://dx.doi.org/10.1186/s13287-020-01814-4] [PMID: 32698871]
[9]
Zhang LS, Liu C, Ma H, et al. Transcriptome-wide mapping of internal N7-methylguanosine methylome in mammalian mRNA. Mol Cell 2019; 74(6): 1304-1316.e8.
[http://dx.doi.org/10.1016/j.molcel.2019.03.036] [PMID: 31031084]
[10]
Song B, Tang Y, Chen K, et al. m7GHub: Deciphering the location, regulation and pathogenesis of internal mRNA N7-methylguanosine (m7G) sites in human. Bioinf 2020; 36(11): 3528-36.
[http://dx.doi.org/10.1093/bioinformatics/btaa178] [PMID: 32163126]
[11]
Chen K, Song B, Tang Y, et al. RMDisease: A database of genetic variants that affect RNA modifications, with implications for epitran-scriptome pathogenesis. Nucleic Acids Res 2021; 49(D1): D1396-404.
[http://dx.doi.org/10.1093/nar/gkaa790] [PMID: 33010174]
[12]
Zhou Y, Kong Y, Fan W, et al. Principles of RNA methylation and their implications for biology and medicine. Biomed Pharmacother 2020; 131: 110731.
[http://dx.doi.org/10.1016/j.biopha.2020.110731] [PMID: 32920520]
[13]
Ma J, Zhang L, Chen J, Song B, Zang C, Liu H. m7GDisAI: N7-methylguanosine (m7G) sites and diseases associations inference based on heterogeneous network. BMC Bioinformatics 2021; 22(1): 152.
[http://dx.doi.org/10.1186/s12859-021-04007-9] [PMID: 33761868]
[14]
Candás E, Ma Y, Wright J. Robust principal component analysis? J Assoc Comput Mach 2011; 58(3): 1-37.
[http://dx.doi.org/10.1145/1970392.1970395]
[15]
Liu G, Lin Z, Yan S, Sun J, Yu Y, Ma Y. Robust recovery of subspace structures by low-rank representation. IEEE Trans Pattern Anal Mach Intell 2013; 35(1): 171-84.
[http://dx.doi.org/10.1109/TPAMI.2012.88] [PMID: 22487984]
[16]
Elhamifar E, Vidal R. Sparse subspace clustering: Algorithm, theory, and applications. IEEE Trans Pattern Anal Mach Intell 2013; 35(11): 2765-81.
[http://dx.doi.org/10.1109/TPAMI.2013.57] [PMID: 24051734]
[17]
Liu L, Huang W, Chen D. Exact minimum rank approximation via Schatten p-norm minimization. J Comput Appl Math 2014; 267: 218-27.
[http://dx.doi.org/10.1016/j.cam.2014.02.015]
[18]
Lu C, Tang J, Yan S, Lin Z. Generalized nonconvex nonsmooth low-rank minimization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014 Jun 23-28; USA, Columbus.
[19]
Cai J, Candes E, Shen Z. A singular value thresholding algorithm for matrix computation. SIAM J Optim 2010; 20(4): 1956-82.
[http://dx.doi.org/10.1137/080738970]
[20]
Nie F, Ding C, Ding C. Low-rank matrix recovery via efficient schatten p-norm minimization. Proceedings of the 26th AAAI Conference on Artificial Intelligence. 2012 Jun 22-26; Canada, Toronto, California.
[21]
Zhang X, Xu C, Sun X, Baciu G. Schatten-q regularizer constrained low rank subspace clustering model. Neurocomputing 2016; 182: 36-47.
[http://dx.doi.org/10.1016/j.neucom.2015.12.009]
[22]
Zhang H, Yang J, Shang F, Gong C, Zhang Z. LRR for subspace segmentation via tractable schatten-$p$ norm minimization and factoriza-tion. IEEE Trans Cybern 2018; 49(5): 1722-34.
[http://dx.doi.org/10.1109/TCYB.2018.2811764] [PMID: 29993878]
[23]
Hu Y, Zhao L, Liu Z, et al. DisSetSim: An online system for calculating similarity between disease sets. J Biomed Semantics 2017; 8(S1) (Suppl. 1): 28.
[http://dx.doi.org/10.1186/s13326-017-0140-2] [PMID: 29297411]
[24]
Candes E, Wakin M, Boyd S. Enhancing sparsity by reweighted l(1) minimization. J Fourier Anal Appl 2008; 14(5): 877-905.
[http://dx.doi.org/10.1007/s00041-008-9045-x]
[25]
Zhang H, Yang J, Qian J, Luo W. Nonconvex relaxation based matrix regression for face recognition with structural noise and mixed noise. Neurocomputing 2017; 269: 188-98.
[http://dx.doi.org/10.1016/j.neucom.2016.12.095]
[26]
Lin Z, Chen M, Ma Y. The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices arXiv preprint arXiv:10095055 2010.
[27]
Zuo W, Meng D, Zhang L, Feng X, Zhang D. A generalized iterated shrinkage algorithm for non-convex sparse coding. Proceedings of 2013 IEEE International Conference on Computer Vision. 2013 Dec 1-8; Australia, Sydney.
[http://dx.doi.org/10.1109/ICCV.2013.34]
[28]
Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett 2006; 27(8): 861-74.
[http://dx.doi.org/10.1016/j.patrec.2005.10.010]
[29]
Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One 2015; 10(3): e0118432.
[http://dx.doi.org/10.1371/journal.pone.0118432] [PMID: 25738806]
[30]
Keilwagen J, Grosse I, Grau J. Area under precision-recall curves for weighted and unweighted data. PLoS One 2014; 9(3): e92209.
[http://dx.doi.org/10.1371/journal.pone.0092209] [PMID: 24651729]
[31]
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018; 68(6): 394-424.
[http://dx.doi.org/10.3322/caac.21492] [PMID: 30207593]
[32]
Okuda S, Yamada T, Hamajima M, et al. KEGG Atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res 2008; 36(Web Server issue): W423-6.
[http://dx.doi.org/10.1093/nar/gkn282] [PMID: 18477636]
[33]
Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 2017; 45(D1): D353-61.
[http://dx.doi.org/10.1093/nar/gkw1092] [PMID: 27899662]
[34]
Dickinson DJ, Nelson WJ, Weis WI. A polarized epithelium organized by beta and alpha catenin predates cadherin and metazoan origins. Science 2011; 331(6022): 1336-9.
[http://dx.doi.org/10.1126/science.1199633] [PMID: 21393547]
[35]
Bazzoun D, Lelièvre S, Talhouk R. Polarity proteins as regulators of cell junction complexes: implications for breast cancer. Pharmacol Ther 2013; 138(3): 418-27.
[http://dx.doi.org/10.1016/j.pharmthera.2013.02.004] [PMID: 23458609]
[36]
Akhurst RJ, Balmain A. Genetic events and the role of TGF beta in epithelial tumour progression. J Pathol 1999; 187(1): 82-90.
[http://dx.doi.org/10.1002/(SICI)1096-9896(199901)187:1<82:AID-PATH248>3.0.CO;2-8] [PMID: 10341708]
[37]
Scollen S, Luccarini C, Baynes C, et al. TGF-β signaling pathway and breast cancer susceptibility. Cancer Epidemiol Biomarkers Prev 2011; 20(6): 1112-9.
[http://dx.doi.org/10.1158/1055-9965.EPI-11-0062] [PMID: 21527583]
[38]
Hennessy BT, Gonzalez-Angulo AM, Stemke-Hale K, et al. Characterization of a naturally occurring breast cancer subset enriched in epi-thelial-to-mesenchymal transition and stem cell characteristics. Cancer Res 2009; 69(10): 4116-24.
[http://dx.doi.org/10.1158/0008-5472.CAN-08-3441] [PMID: 19435916]
[39]
Harvey KF, Zhang X, Thomas DM. The Hippo pathway and human cancer. Nat Rev Cancer 2013; 13(4): 246-57.
[http://dx.doi.org/10.1038/nrc3458] [PMID: 23467301]
[40]
Sjöblom T, Jones S, Wood LD, et al. The consensus coding sequences of human breast and colorectal cancers. Science 2006; 314(5797): 268-74.
[http://dx.doi.org/10.1126/science.1133427] [PMID: 16959974]
[41]
Wood LD, Parsons DW, Jones S, et al. The genomic landscapes of human breast and colorectal cancers. Science 2007; 318(5853): 1108-13.
[http://dx.doi.org/10.1126/science.1145720] [PMID: 17932254]
[42]
Polak P, Kim J, Braunstein LZ, et al. A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer. Nat Genet 2017; 49(10): 1476-86.
[http://dx.doi.org/10.1038/ng.3934] [PMID: 28825726]
[43]
Malik SS, Masood N, Asif M, Ahmed P, Shah ZU, Khan JS. Expressional analysis of MLH1 and MSH2 in breast cancer. Curr Probl Cancer 2019; 43(2): 97-105.
[http://dx.doi.org/10.1016/j.currproblcancer.2018.08.001] [PMID: 30149959]
[44]
Al-Husseini MJ, Mohamed HH, Saad AM, et al. Risk and survival of chronic myeloid leukemia after breast cancer: A population-based study. Curr Probl Cancer 2019; 43(3): 213-21.
[http://dx.doi.org/10.1016/j.currproblcancer.2018.08.005] [PMID: 30195804]

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy