Generic placeholder image

Current Bioinformatics

Editor-in-Chief

ISSN (Print): 1574-8936
ISSN (Online): 2212-392X

Research Article

A Network-Based Method for the Detection of Cancer Driver Genes in Transcriptional Regulatory Networks Using the Structural Analysis of Weighted Regulatory Interactions

Author(s): Mostafa Akhavan-Safar, Babak Teimourpour* and Abbas Nowzari-Dalini

Volume 17, Issue 4, 2022

Published on: 21 April, 2022

Page: [327 - 343] Pages: 17

DOI: 10.2174/1574893617666220127094224

Price: $65

Abstract

Background: Identifying genes that instigate cell anomalies and cause cancer in humans is an important field in oncology research. Abnormalities in these genes are transferred to other genes in the cell, disrupting its normal functionality. Such genes are known as cancer driver genes (CDGs). Various methods have been proposed for predicting CDGs, mostly based on genomic data and computational methods. Some novel bioinformatic approaches have been developed.

Objective: In this article, we propose a network-based algorithm, SalsaDriver (Stochastic approach for link-structure analysis for driver detection), which can calculate each gene's receiving and influencing power using the stochastic analysis of regulatory interaction structures in gene regulatory networks.

Methods: First, regulatory networks related to breast, colon, and lung cancers are constructed using gene expression data and a list of regulatory interactions, the weights of which are then calculated using biological and topological features of the network. After that, the weighted regulatory interactions are used in the structural analysis of interactions, with two separate Markov chains on the bipartite graph taken from the main graph of the gene network and the implementation of the stochastic approach for link-structure analysis. The proposed algorithm categorizes higher-ranked genes as driver genes.

Results: The proposed algorithm was compared with 24 other computational and network tools based on the F-measure value and the number of detected CDGs. The results were validated using four databases. The findings of this study show that SalsaDriver outperforms other methods and can identify substantiallyy more driver genes than other methods.

Conclusion: The SalsaDriver network-based approach is suitable for predicting CDGs and can be used as a complementary method along with other computational tools.

Keywords: Driver genes, cancer, Link-structure analysis, regulatory interactions, cell anomalies, F-measure value.

Graphical Abstract

[1]
Ding L, Getz G, Wheeler DA, et al. Somatic mutations affect key pathways in lung adenocarcinoma. Nature 2008; 455(7216): 1069-75.
[http://dx.doi.org/10.1038/nature07423] [PMID: 18948947]
[2]
Mularoni L, Sabarinathan R, Deu-Pons J, Gonzalez-Perez A, López-Bigas N. OncodriveFML: a general framework to identify coding and non-coding regions with cancer driver mutations. Genome Biol 2016; 17(1): 128.
[http://dx.doi.org/10.1186/s13059-016-0994-0] [PMID: 27311963]
[3]
Sakoparnig T, Fried P, Beerenwinkel N. Identification of constrained cancer driver genes based on mutation timing. PLOS Comput Biol 2015; 11(1): e1004027.
[http://dx.doi.org/10.1371/journal.pcbi.1004027] [PMID: 25569148]
[4]
Gilissen C, Hehir-Kwa JY, Thung DT, et al. Genome sequencing identifies major causes of severe intellectual disability. Nature 2014; 511(7509): 344-7.
[http://dx.doi.org/10.1038/nature13394] [PMID: 24896178]
[5]
Youn A, Simon R. Identifying cancer driver genes in tumor genome sequencing studies. Bioinformatics 2011; 27(2): 175-81.
[http://dx.doi.org/10.1093/bioinformatics/btq630] [PMID: 21169372]
[6]
Gonzalez-Perez A, Lopez-Bigas N. Functional impact bias reveals cancer drivers. Nucleic Acids Res 2012; 40(21): e169.
[http://dx.doi.org/10.1093/nar/gks743] [PMID: 22904074]
[7]
Reimand J, Wagih O, Bader GD. The mutational landscape of phosphorylation signaling in cancer. Sci Rep 2013; 3(1): 2651.
[http://dx.doi.org/10.1038/srep02651] [PMID: 24089029]
[8]
Hua X, Xu H, Yang Y, Zhu J, Liu P, Lu Y. DrGaP: a powerful tool for identifying driver genes and pathways in cancer sequencing studies. Am J Hum Genet 2013; 93(3): 439-51.
[http://dx.doi.org/10.1016/j.ajhg.2013.07.003] [PMID: 23954162]
[9]
Aure MR, Steinfeld I, Baumbusch LO, et al. Identifying in-trans process associated genes in breast cancer by integrated analysis of copy number and expression data. PLoS One 2013; 8(1): e53014.
[http://dx.doi.org/10.1371/journal.pone.0053014] [PMID: 23382830]
[10]
Lawrence MS, Stojanov P, Polak P, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature 2013; 499(7457): 214-8.
[http://dx.doi.org/10.1038/nature12213] [PMID: 23770567]
[11]
Tamborero D, Gonzalez-Perez A, Lopez-Bigas N. OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes. Bioinformatics 2013; 29(18): 2238-44.
[http://dx.doi.org/10.1093/bioinformatics/btt395] [PMID: 23884480]
[12]
Porta-Pardo E, Godzik A. e-Driver: a novel method to identify protein regions driving cancer. Bioinformatics 2014; 30(21): 3109-14.
[http://dx.doi.org/10.1093/bioinformatics/btu499] [PMID: 25064568]
[13]
Arneson D, Bhattacharya A, Shu L, Mäkinen VP, Yang X. Mergeomics: a web server for identifying pathological pathways, networks, and key regulators via multidimensional data integration. BMC Genomics 2016; 17(1): 722.
[http://dx.doi.org/10.1186/s12864-016-3057-8] [PMID: 27612452]
[14]
Lanzós A, Carlevaro-Fita J, Mularoni L, et al. Discovery of cancer driver long noncoding RNAs across 1112 tumour genomes: new candi-dates and distinguishing features. Sci Rep 2017; 7(1): 41544.
[http://dx.doi.org/10.1038/srep41544] [PMID: 28128360]
[15]
Wang Z, Ng KS, Chen T, et al. Cancer driver mutation prediction through Bayesian integration of multi-omic data. PLoS One 2018; 13(5): e0196939.
[http://dx.doi.org/10.1371/journal.pone.0196939] [PMID: 29738578]
[16]
Vandin F, Upfal E, Raphael BJ. De novo discovery of mutated driver pathways in cancer. Genome Res 2012; 22(2): 375-85.
[http://dx.doi.org/10.1101/gr.120477.111] [PMID: 21653252]
[17]
Han Y, Yang J, Qian X, et al. DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies. Nucleic Acids Res 2019; 47(8): e45.
[http://dx.doi.org/10.1093/nar/gkz096] [PMID: 30773592]
[18]
Cerami E, Demir E, Schultz N, Taylor BS, Sander C. Automated network analysis identifies core pathways in glioblastoma. PLoS One 2010; 5(2): e8918.
[http://dx.doi.org/10.1371/journal.pone.0008918] [PMID: 20169195]
[19]
Bashashati A, Haffari G, Ding J, et al. DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer. Genome Biol 2012; 13(12): R124.
[http://dx.doi.org/10.1186/gb-2012-13-12-r124] [PMID: 23383675]
[20]
Zhao J, Zhang S, Wu LY, Zhang XS. Efficient methods for identifying mutated driver pathways in cancer. Bioinformatics 2012; 28(22): 2940-7.
[http://dx.doi.org/10.1093/bioinformatics/bts564] [PMID: 22982574]
[21]
Ciriello G, Cerami E, Sander C, Schultz N. Mutual exclusivity analysis identifies oncogenic network modules. Genome Res 2012; 22(2): 398-406.
[http://dx.doi.org/10.1101/gr.125567.111] [PMID: 21908773]
[22]
Zhang J, Wu LY, Zhang XS, Zhang S. Discovery of co-occurring driver pathways in cancer. BMC Bioinformatics 2014; 15(1): 271.
[http://dx.doi.org/10.1186/1471-2105-15-271] [PMID: 25106096]
[23]
Hou JP, Ma J. DawnRank: discovering personalized driver genes in cancer. Genome Med 2014; 6(7): 56.
[http://dx.doi.org/10.1186/s13073-014-0056-8] [PMID: 25177370]
[24]
Guo WF, Zhang SW, Liu LL, et al. Discovering personalized driver mutation profiles of single samples in cancer by network control strat-egy. Bioinformatics 2018; 34(11): 1893-903.
[http://dx.doi.org/10.1093/bioinformatics/bty006] [PMID: 29329368]
[25]
Rahimi M, Teimourpour B, Marashi SA. Cancer driver gene discovery in transcriptional regulatory networks using influence maximization approach. Comput Biol Med 2019; 114: 103362.
[http://dx.doi.org/10.1016/j.compbiomed.2019.103362] [PMID: 31561101]
[26]
Akhavan-Safar M, Teimourpour B, Kargari M. GenHITS: A network science approach to driver gene detection in human regulatory net-work using gene’s influence evaluation. J Biomed Inform 2021; 114: 103661.
[http://dx.doi.org/10.1016/j.jbi.2020.103661] [PMID: 33326867]
[27]
Gagniuc PA. Markov chains: from theory to implementation and experimentation. John Wiley & Sons 2017.
[http://dx.doi.org/10.1002/9781119387596]
[28]
Page L, Brin S, Motwani R, Winograd T. The PageRank citation ranking: Bringing order to the web. Stanford InfoLab 1999.
[29]
Kleinberg JM. Authoritative sources in a hyperlinked environment. J Assoc Comput Mach 1999; 46(5): 604-32.
[http://dx.doi.org/10.1145/324133.324140]
[30]
Lempel R, Moran S. The stochastic approach for link-structure analysis (SALSA) and the TKC effect. Comput Netw 2000; 33(1-6): 387-401.
[http://dx.doi.org/10.1016/S1389-1286(00)00034-7]
[31]
Lempel R, Moran S. SALSA: the stochastic approach for link-structure analysis. ACM Trans Inf Syst 2001; 19(2): 131-60. TOIS
[http://dx.doi.org/10.1145/382979.383041]
[32]
Han H, Cho JW, Lee S, et al. TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions. Nucleic Acids Res 2018; 46(D1): D380-6.
[http://dx.doi.org/10.1093/nar/gkx1013] [PMID: 29087512]
[33]
Chung IF, Chen CY, Su SC, et al. DriverDBv2: a database for human cancer driver gene research. Nucleic Acids Res 2016; 44(D1): D975-9.
[http://dx.doi.org/10.1093/nar/gkv1314] [PMID: 26635391]
[34]
Li W, Bai X, Hu E, et al. Identification of breast cancer prognostic modules based on weighted protein-protein interaction networks. Oncol Lett 2017; 13(5): 3935-41.
[http://dx.doi.org/10.3892/ol.2017.5917] [PMID: 28529601]
[35]
Robin X, Turck N, Hainard A, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 2011; 12(1): 77.
[http://dx.doi.org/10.1186/1471-2105-12-77] [PMID: 21414208]

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy