Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Research Article

Biomarkers Identification of Hepatocellular Carcinoma Based on Multiomics Data Integration and Graph-embedded Deep Neural Network

Author(s): Chaokun Yan, Mengyuan Li, Zhihao Suo, Jun Zhang, Jianlin Wang, Ge Zhang, Wenjuan Liang and Huimin Luo*

Volume 18, Issue 6, 2023

Published on: 17 April, 2023

Page: [459 - 471] Pages: 13

DOI: 10.2174/1574893618666230227122331

Price: $65

conference banner
Abstract

Background: Hepatocellular carcinoma (HCC) is one of the malignancies with high mortality rate, and identify relevant biomarkers of HCC is helpful for early diagnosis and patient care. Though some high-dimensional omic data contains intrinsic biomedical information about HCC, how to integrate analysis them effectively and find promising biomarkers of HCC is still an important and difficult issue.

Methods: We present a novel biomarker identification approach, named GEDNN, based on multi-omic data and graph-embedded deep neural network. To achieve a more comprehensive understanding of HCC, we first collected and normalized the three following types of HCC-related data: DNA methylation, copy number variation (CNV), and gene expression. The ANOVA was adopted to filter out redundant genes. Then, we measured the connectivity between gene pairs by Pearson correlation coefficient of gene pairs, and further construct gene graph. Next, graph-embedded feedforward neural network (DFN) and back-propagation of convolutional neural network (CNN) were combined to integratively analyze the three types of omics data and achieve the importance score of gene biomarkers.

Results: Extensive experimental results showed that the biomarkers screened by the proposed method were effective in classifying and predicting HCC. Furthermore, the gene analysis further showed that the biomarkers screened by our method were strongly associated with the development of HCC.

Conclusion: In this paper, we propose the GEDNN method to assess the importance of genes for more accurate identification of cancer biomarkers, which facilitates the effective classification of cancers. The proposed method is applied to multi-omics data of HCC, including RNASeq, DNAMeth and CNV, considering the complementary information between different types of data. We construct a gene graph by Pearson correlation coefficients as additional information for DFN, thus reducing the importance score of redundant genes. In addition, the proposed method also incorporates back-propagation of CNN to further obtain the importance of features.

Next »
[1]
Singal AG, Lampertico P, Nahon P. Epidemiology and surveillance for hepatocellular carcinoma: New trends. J Hepatol 2020; 72(2): 250-61.
[http://dx.doi.org/10.1016/j.jhep.2019.08.025] [PMID: 31954490]
[2]
Tsuchiya N, Sawada Y, Endo I, Saito K, Uemura Y, Nakatsura T. Biomarkers for the early diagnosis of hepatocellular carcinoma. World J Gastroenterol 2015; 21(37): 10573-83.
[http://dx.doi.org/10.3748/wjg.v21.i37.10573] [PMID: 26457017]
[3]
Guan MC, Wang MD, Liu SY, et al. Early diagnosis and therapeutic strategies for hepatocellular carcinoma: From bench to bedside. World J Gastrointest Oncol 2021; 13(4): 197-215.
[http://dx.doi.org/10.4251/wjgo.v13.i4.197] [PMID: 33889272]
[4]
Lin E, Lane HY. Machine learning and systems genomics approaches for multi-omics data. Biomark Res 2017; 5(1): 2.
[http://dx.doi.org/10.1186/s40364-017-0082-y] [PMID: 28127429]
[5]
Zhang W, Zhang H, Yang H, Li M, Xie Z, Li W. Computational resources associating diseases with genotypes, phenotypes and exposures. Brief Bioinform 2019; 20(6): 2098-115.
[http://dx.doi.org/10.1093/bib/bby071] [PMID: 30102366]
[6]
Mahmud M, Kaiser MS, McGinnity TM, Hussain A. Deep learning in mining biological data. Cognit Comput 2021; 13(1): 1-33.
[http://dx.doi.org/10.1007/s12559-020-09773-x] [PMID: 33425045]
[7]
Wang T, Shao W, Huang Z, et al. MOGONET integrates multiomics data using graph convolutional networks allowing patient classification and biomarker identification. Nat Commun 2021; 12(1): 3445.
[http://dx.doi.org/10.1038/s41467-021-23774-w] [PMID: 34103512]
[8]
Wang C, Chen L, Yang Y, Zhang M, Wong G. Identification of potential blood biomarkers for Parkinson’s disease by gene expression and DNA methylation data integration analysis. Clin Epigenetics 2019; 11(1): 24.
[http://dx.doi.org/10.1186/s13148-019-0621-5] [PMID: 30744671]
[9]
Cheng J, Wei D, Ji Y, et al. Integrative analysis of DNA methylation and gene expression reveals hepatocellular carcinoma-specific diagnostic biomarkers. Genome Med 2018; 10(1): 42.
[http://dx.doi.org/10.1186/s13073-018-0548-z] [PMID: 29848370]
[10]
Tong D, Tian Y, Zhou T, et al. Improving prediction performance of colon cancer prognosis based on the integration of clinical and multi-omics data. BMC Med Inform Decis Mak 2020; 20(1): 22.
[http://dx.doi.org/10.1186/s12911-020-1043-1] [PMID: 32033604]
[11]
Lemsara A, Ouadfel S, Fröhlich H. PathME: pathway based multimodal sparse autoencoders for clustering of patient-level multiomics data. BMC Bioinformat 2020; 21(1): 146.
[http://dx.doi.org/10.1186/s12859-020-3465-2] [PMID: 32299344]
[12]
Emura T, Chen YH. Gene selection for survival data under dependent censoring: A copula-based approach. Stat Methods Med Res 2016; 25(6): 2840-57.
[http://dx.doi.org/10.1177/0962280214533378] [PMID: 24821000]
[13]
van de Vijver MJ, He YD, van ’t Veer LJ, et al. A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 2002; 347(25): 1999-2009.
[http://dx.doi.org/10.1056/NEJMoa021967] [PMID: 12490681]
[14]
Faris H, Mafarja MM, Heidari AA, et al. An efficient binary Salp Swarm Algorithm with crossover scheme for feature selection problems. Knowl Base Syst 2018; 154: 43-67.
[http://dx.doi.org/10.1016/j.knosys.2018.05.009]
[15]
Subanya B, Rajalaxmi RR. Feature selection using Artificial Bee Colony for cardiovascular disease classification. International Conference on Electronics and Communication Systems (ICECS). Coimbatore, India. 2014; 1-6.
[http://dx.doi.org/10.1109/ECS.2014.6892729]
[16]
Salem H, Attiya G, El-Fishawy N. Classification of human cancer diseases by gene expression profiles. Appl Soft Comput 2017; 50: 124-34.
[http://dx.doi.org/10.1016/j.asoc.2016.11.026]
[17]
Jain I, Jain VK, Jain R. Correlation feature selection based improved-binary particle swarm optimization for gene selection and cancer classification. Appl Soft Comput 2018; 62: 203-15.
[http://dx.doi.org/10.1016/j.asoc.2017.09.038]
[18]
LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015; 521(7553): 436-44.
[19]
Jiang X, Zhao J, Qian W, Song W, Lin GN. A generative adversarial network model for disease gene prediction with RNA-seq data. IEEE Access 2020; 8: 37352-60.
[http://dx.doi.org/10.1109/ACCESS.2020.2975585]
[20]
Lim J, Bang S Y, Kim J, et al. Integrative deep learning for identifying differentially expressed (DE) biomarkers. Computat Math Methods Med 2019; 2019: 8418760.
[21]
Zhu Y, Ferri-Borgogno S, Sheng J, et al. Deep learning on imageomics data in identifying prognostic immune biomarkers for ovarian cancer 2020.
[22]
Kong Y, Yu T. A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data. Bioinformatics 2018; 34(21): 3727-37.
[http://dx.doi.org/10.1093/bioinformatics/bty429] [PMID: 29850911]
[23]
Park C, Oh I, Choi J, Ko S, Ahn J. Improved prediction of cancer outcome using graph-embedded generative adversarial networks. IEEE Access 2021; 9: 20076-88.
[http://dx.doi.org/10.1109/ACCESS.2021.3054894]
[24]
Kong Y, Yu T. forgeNet: A graph deep neural network model using tree-based ensemble classifiers for feature graph construction. Bioinformatics 2020; 36(11): 3507-15.
[http://dx.doi.org/10.1093/bioinformatics/btaa164] [PMID: 32163118]
[25]
Ang JC, Haron H, Hamed HNA. Semi-supervised svm-based feature selection for cancer classifification using microarray gene expression data. International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems. Seoul, South Korea 2015; pp. 468-77.
[26]
Xu W, Xu M, Wang L, et al. Integrative analysis of DNA methylation and gene expression identified cervical cancer-specific diagnostic biomarkers. Signal Transduct Target Ther 2019; 4(1): 55.
[http://dx.doi.org/10.1038/s41392-019-0081-6] [PMID: 31871774]
[27]
Sedgwick P. Pearson’s correlation coefficient. BMJ 2012; 345: e4483.
[28]
Fatima N, Rueda L. iSOM-GSN: An integrative approach for transforming multi-omic data into gene similarity networks via selforganizing maps. Bioinformatics 2020; 36(15): 4248-54.
[http://dx.doi.org/10.1093/bioinformatics/btaa500] [PMID: 32407457]
[29]
Springenberg JT, Dosovitskiy A, Brox T, et al. Striving for simplicity: The all convolutional net. ArXiv 2014: 14126806.
[30]
Park C, Ha J, Park S. Prediction of Alzheimer’s disease based on deep neural network by integrating gene expression and DNA methylation dataset. Expert Syst Appl 2020; 140: 112873.
[http://dx.doi.org/10.1016/j.eswa.2019.112873]
[31]
Kim M, Oh I, Ahn J. An improved method for prediction of cancer prognosis by network learning. Genes (Basel) 2018; 9(10): 478.
[http://dx.doi.org/10.3390/genes9100478] [PMID: 30279327]
[32]
Choi J, Oh I, Seo S, Ahn J. G2Vec: Distributed gene representations for identification of cancer prognostic genes. Sci Rep 2018; 8(1): 13729.
[http://dx.doi.org/10.1038/s41598-018-32180-0] [PMID: 30213980]
[33]
Lin Y, Liang R, Ye J, et al. A twenty gene-based gene set variation score reflects the pathological progression from cirrhosis to hepatocellular carcinoma. Aging 2019; 11(23): 11157-69.
[http://dx.doi.org/10.18632/aging.102518] [PMID: 31811111]
[34]
Yang TB, Yi F, Liu WF, Yang YH, Yang C, Sun J. Identification of hsa_circ_0039053 as an up-regulated and oncogenic circRNA in hepatocellular carcinoma via the miR-637-mediated USP21 activation. J Cancer 2020; 11(23): 6950-9.
[http://dx.doi.org/10.7150/jca.48998] [PMID: 33123285]
[35]
Li W, Cui K, Prochownik EV, Li Y. The deubiquitinase USP21 stabilizes MEK2 to promote tumor growth. Cell Death Dis 2018; 9(5): 482.
[http://dx.doi.org/10.1038/s41419-018-0523-z] [PMID: 29706623]
[36]
Yu J, Huang W, Xu Q, et al. Overactivated neddylation pathway in human hepatocellular carcinoma. Cancer Med 2018; 7(7): 3363-72.
[http://dx.doi.org/10.1002/cam4.1578] [PMID: 29846044]
[37]
Enomoto H, Nakamura H, Liu W, et al. Down-regulation of HDGF inhibits the growth of hepatocellular carcinoma cells in vitro and in vivo. Anticancer Res 2015; 35(12): 6475-9.
[PMID: 26637859]
[38]
Yang G, Zhang A, Wang J, et al. Hepatoma-derived growth factor promotes growth and metastasis of hepatocellular carcinoma cells. Cell Biochem Funct 2016; 34(4): 274-85.
[http://dx.doi.org/10.1002/cbf.3189] [PMID: 27273265]
[39]
Enomoto H, Nakamura H, Nishikawa H, et al. Hepatocellular carcinoma-associated microRNAs induced by hepatoma-derived growth factor stimulation. In vivo 2020; 34(5): 2297-301.
[40]
Wei Y, Chen X, Liang C, et al. A noncoding regulatory RNAs network driven by CircCDYL acts specifically in the early stages hepatocellular carcinoma. Hepatology 2020; 71(1): 130-47.
[http://dx.doi.org/10.1002/hep.30795] [PMID: 31148183]
[41]
Zuo X, Chen Z, Gao W, et al. M6A-mediated upregulation of LINC00958 increases lipogenesis and acts as a nanotherapeutic target in hepatocellular carcinoma. J Hematol Oncol 2020; 13(1): 5.
[http://dx.doi.org/10.1186/s13045-019-0839-x] [PMID: 31915027]
[42]
Li H, Lan T, Xu L, et al. NCSTN promotes hepatocellular carcinoma cell growth and metastasis via β -catenin activation in a Notch1/AKT dependent manner. J Exp Clin Cancer Res 2020; 39(1): 128.
[http://dx.doi.org/10.1186/s13046-020-01638-3] [PMID: 32631394]
[43]
Wang X, Wang X, Xu Y, et al. Effect of nicastrin on hepatocellular carcinoma proliferation and apoptosis through PI3K/AKT signalling pathway modulation. Cancer Cell Int 2020; 20(1): 91.
[http://dx.doi.org/10.1186/s12935-020-01172-4] [PMID: 32226312]
[44]
Song Q, Zhu X, Jin L, Chen M, Zhang W, Su J. SMGR: A joint statistical method for integrative analysis of single-cell multi-omics data. NAR Genom Bioinform 2022; 4(3): lqac056.
[http://dx.doi.org/10.1093/nargab/lqac056] [PMID: 35910046]
[45]
Song Q, Wang H, Bao J, et al. Systems biology approach to studying proliferation-dependent prognostic subnetworks in breast cancer. Sci Rep 2015; 5(1): 12981.
[http://dx.doi.org/10.1038/srep12981] [PMID: 26257336]

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy