Abstract
Breast cancer is the most predominantly occurring cancer in the world. Several genes and proteins have been recently studied to predict biomarkers that enable early disease identification and monitor its recurrence. In the era of high-throughput technology, studies show several applications of big data for identifying potential biomarkers. The review aims to provide a comprehensive overview of big data analysis in breast cancer towards the prediction of biomarkers with emphasis on computational methods like text mining, network analysis, next-generation sequencing technology (NGS), machine learning (ML), deep learning (DL), and precision medicine. Integrating data from various computational approaches enables the stratification of cancer patients and the identification of molecular signatures in cancer and their subtypes. The computational methods and statistical analysis help expedite cancer prognosis and develop precision cancer medicine (PCM). As a part of case study in the present work, we constructed a large gene-drug interaction network to predict new biomarkers genes. The gene-drug network helped us to identify eight genes that could serve as novel potential biomarkers.
Keywords: Breast cancer, biomarkers, big data, text mining, network analysis, driver genes.
Graphical Abstract
[http://dx.doi.org/10.1001/jama.2018.19323]
[http://dx.doi.org/10.1038/s41416-021-01328-7] [PMID: 33824479]
[http://dx.doi.org/10.1016/S2214-109X(20)30215-1] [PMID: 32710860]
[http://dx.doi.org/10.1002/cac2.12207] [PMID: 34399040]
[http://dx.doi.org/10.3322/caac.21660] [PMID: 33538338]
[http://dx.doi.org/10.1200/GO.20.00033] [PMID: 32511068]
[http://dx.doi.org/10.5152/ejbh.2017.3219] [PMID: 29322114]
[http://dx.doi.org/10.1002/path.5040] [PMID: 29344954]
[http://dx.doi.org/10.3390/cancers12030609] [PMID: 32155777]
[http://dx.doi.org/10.1186/s12885-020-6608-y] [PMID: 32050925]
[http://dx.doi.org/10.7150/jca.13141] [PMID: 27390604]
[http://dx.doi.org/10.1016/j.ajpath.2018.08.020] [PMID: 30385093]
[http://dx.doi.org/10.1038/s41598-019-55710-w] [PMID: 31836816]
[http://dx.doi.org/10.3892/ol.2018.8548] [PMID: 29844843]
[http://dx.doi.org/10.1001/jamanetworkopen.2020.13226] [PMID: 32804214]
[http://dx.doi.org/10.3389/fonc.2021.632357] [PMID: 34367947]
[http://dx.doi.org/10.5306/wjco.v5.i3.412] [PMID: 25114856]
[http://dx.doi.org/10.1038/s41598-021-04032-x] [PMID: 34997055]
[http://dx.doi.org/10.3389/fgene.2020.574661] [PMID: 33193681]
[http://dx.doi.org/10.1677/ERC-10-0136] [PMID: 20647302]
[http://dx.doi.org/10.1177/1178223421995854] [PMID: 33994789]
[http://dx.doi.org/10.1155/2020/1835691] [PMID: 32256579]
[http://dx.doi.org/10.1038/s41392-019-0069-2] [PMID: 31637013]
[http://dx.doi.org/10.5306/wjco.v8.i2.120] [PMID: 28439493]
[http://dx.doi.org/10.3389/fonc.2018.00227] [PMID: 29963498]
[http://dx.doi.org/10.1186/1756-8722-6-38] [PMID: 23731980]
[http://dx.doi.org/10.1016/j.cdtm.2018.04.002] [PMID: 30276363]
[http://dx.doi.org/10.1200/EDBK_320667] [PMID: 34061559]
[http://dx.doi.org/10.1158/1535-7163.MCT-20-0848] [PMID: 34158347]
[http://dx.doi.org/10.20517/cdr.2019.002] [PMID: 35582575]
[http://dx.doi.org/10.1007/s10549-020-05638-x] [PMID: 32323103]
[http://dx.doi.org/10.1016/j.mce.2021.111322] [PMID: 34000350]
[http://dx.doi.org/10.1016/j.annonc.2021.02.011] [PMID: 33617937]
[http://dx.doi.org/10.2174/187152008786847747] [PMID: 19075570]
[http://dx.doi.org/10.1042/BSR20171357] [PMID: 29298879]
[http://dx.doi.org/10.2174/1570163817666200518081955] [PMID: 32418525]
[http://dx.doi.org/10.3389/fonc.2021.731535] [PMID: 34778045]
[http://dx.doi.org/10.1038/s41523-020-0153-3] [PMID: 32195333]
[PMID: 22896759]
[http://dx.doi.org/10.1038/s41392-021-00868-x] [PMID: 35132063]
[http://dx.doi.org/10.1200/JCO.18.01160] [PMID: 30452337]
[http://dx.doi.org/10.1007/s10555-016-9649-6] [PMID: 27913999]
[http://dx.doi.org/10.1016/j.ddmod.2017.07.002]
[http://dx.doi.org/10.4155/bio-2018-0006] [PMID: 29923753]
[http://dx.doi.org/10.1208/s12248-017-0161-x] [PMID: 29181807]
[http://dx.doi.org/10.5402/2012/590626] [PMID: 22523699]
[http://dx.doi.org/10.1016/j.biopha.2019.109687] [PMID: 31918267]
[http://dx.doi.org/10.1186/s13148-018-0587-8] [PMID: 30744689]
[http://dx.doi.org/10.1016/j.biopha.2020.110986] [PMID: 33166764]
[http://dx.doi.org/10.1016/j.abst.2019.05.001]
[http://dx.doi.org/10.1177/1010428319881344] [PMID: 31608792]
[http://dx.doi.org/10.1186/s13148-018-0492-1] [PMID: 29713393]
[http://dx.doi.org/10.1634/theoncologist.2017-0535] [PMID: 29472313]
[http://dx.doi.org/10.1159/000509846] [PMID: 32982645]
[http://dx.doi.org/10.3389/fgene.2021.682503] [PMID: 34220957]
[http://dx.doi.org/10.1200/JCO.2020.38.6_suppl.162]
[http://dx.doi.org/10.3389/fonc.2022.917400] [PMID: 35880165]
[http://dx.doi.org/10.1155/2022/5621441] [PMID: 35242245]
[http://dx.doi.org/10.1148/radiol.2018171118] [PMID: 30040052]
[http://dx.doi.org/10.5306/wjco.v6.i6.252] [PMID: 26677438]
[http://dx.doi.org/10.3390/ijms21134579] [PMID: 32605126]
[http://dx.doi.org/10.1186/s12885-021-08318-1] [PMID: 34053447]
[http://dx.doi.org/10.1038/s41523-020-00197-2] [PMID: 33088912]
[http://dx.doi.org/10.21037/pcm-20-76]
[http://dx.doi.org/10.1016/j.ctrv.2020.102064] [PMID: 32622272]
[http://dx.doi.org/10.1186/s13046-021-02098-z] [PMID: 34620206]
[http://dx.doi.org/10.1200/OP.21.00172] [PMID: 34077236]
[http://dx.doi.org/10.1186/s12957-020-02026-z] [PMID: 32977823]
[http://dx.doi.org/10.1038/s41419-019-2043-x] [PMID: 31649243]
[http://dx.doi.org/10.1007/s10549-021-06294-5] [PMID: 34185195]
[http://dx.doi.org/10.1038/nrclinonc.2012.121] [PMID: 22825374]
[http://dx.doi.org/10.1007/s13193-014-0290-y] [PMID: 24669166]
[http://dx.doi.org/10.18632/oncotarget.13990] [PMID: 27999206]
[http://dx.doi.org/10.1371/journal.pone.0188068] [PMID: 29140993]
[http://dx.doi.org/10.1186/1471-2407-11-417] [PMID: 21955753]
[http://dx.doi.org/10.3389/fmolb.2022.783450] [PMID: 35265667]
[http://dx.doi.org/10.3389/fphar.2020.632079] [PMID: 33716731]
[http://dx.doi.org/10.1371/journal.pone.0226765] [PMID: 31881042]
[http://dx.doi.org/10.1038/s41437-020-0303-2] [PMID: 32139886]
[http://dx.doi.org/10.1016/j.semradonc.2019.05.002] [PMID: 31472730]
[http://dx.doi.org/10.1016/j.gpb.2015.01.005] [PMID: 25707591]
[http://dx.doi.org/10.1093/bib/bbz121] [PMID: 31774481]
[http://dx.doi.org/10.1371/journal.pcbi.1005752] [PMID: 29099853]
[http://dx.doi.org/10.1371/journal.pone.0055489] [PMID: 23408991]
[http://dx.doi.org/10.1038/s41586-020-1969-6] [PMID: 32025007]
[http://dx.doi.org/10.1093/nar/gkp995] [PMID: 19906727]
[http://dx.doi.org/10.1021/acs.jproteome.5b01091] [PMID: 26860878]
[http://dx.doi.org/10.1093/nar/gkt1025] [PMID: 24214964]
[http://dx.doi.org/10.1038/nature11003] [PMID: 22460905]
[http://dx.doi.org/10.1158/2159-8290.CD-12-0095] [PMID: 22588877]
[http://dx.doi.org/10.1038/sdata.2017.124] [PMID: 28925987]
[http://dx.doi.org/10.1186/gb-2004-5-10-r80] [PMID: 15461798]
[http://dx.doi.org/10.1093/nar/gkaa434] [PMID: 32479607]
[http://dx.doi.org/10.1016/j.imu.2018.05.003]
[http://dx.doi.org/10.1016/j.gendis.2017.07.003] [PMID: 30258916]
[http://dx.doi.org/10.1080/21553769.2016.1178180]
[http://dx.doi.org/10.1371/journal.pone.0030619] [PMID: 22312429]
[http://dx.doi.org/10.1016/j.preteyeres.2016.06.001] [PMID: 27297499]
[http://dx.doi.org/10.1038/s41598-020-67393-9] [PMID: 32591580]
[http://dx.doi.org/10.3390/cancers12102928] [PMID: 33053644]
[http://dx.doi.org/10.1371/journal.pone.0191195] [PMID: 29324832]
[http://dx.doi.org/10.2147/PGPM.S167886] [PMID: 31213877]
[http://dx.doi.org/10.1016/j.copbio.2019.03.004] [PMID: 30965188]
[http://dx.doi.org/10.1155/2013/865980] [PMID: 23690882]
[http://dx.doi.org/10.3389/fgene.2019.00049] [PMID: 30809243]
[http://dx.doi.org/10.1016/j.flm.2017.06.001]
[http://dx.doi.org/10.2174/1386207318666150703112447] [PMID: 26138573]
[http://dx.doi.org/10.1016/j.csbj.2014.11.005] [PMID: 25750696]
[http://dx.doi.org/10.1155/2021/9025470] [PMID: 34754327]
[http://dx.doi.org/10.1016/j.ejca.2021.10.007] [PMID: 34810047]
[http://dx.doi.org/10.1186/s13073-021-00968-x] [PMID: 34579788]
[http://dx.doi.org/10.1016/j.compbiolchem.2016.09.011] [PMID: 27744173]
[http://dx.doi.org/10.1101/gr.1239303] [PMID: 14597658]
[http://dx.doi.org/10.1016/j.omto.2021.12.025] [PMID: 35118192]
[http://dx.doi.org/10.1186/s12885-021-08641-7] [PMID: 34407787]
[http://dx.doi.org/10.1097/MD.0000000000028425] [PMID: 34941192]
[http://dx.doi.org/10.1111/jcmm.17041] [PMID: 35170195]
[http://dx.doi.org/10.3389/fonc.2021.783211] [PMID: 34869037]
[http://dx.doi.org/10.1016/j.canlet.2021.07.004] [PMID: 34274167]
[http://dx.doi.org/10.1111/tbj.13067] [PMID: 29785740]
[http://dx.doi.org/10.3389/fimmu.2022.904682] [PMID: 35844507]
[http://dx.doi.org/10.1016/j.cell.2017.06.031] [PMID: 28709002]
[PMID: 32744648]