Abstract
Background: Lung cancer is one of the malignancies exhibiting the fastest increase in morbidity and mortality, but the cause is not clearly understood. The goal of this investigation was to screen and identify relevant biomarkers of lung cancer.
Methods: Publicly available lung cancer data sets, including GSE40275 and GSE134381, were obtained from the GEO database. The repeatability test for data was done by principal component analysis (PCA), and a GEO2R was performed to screen differentially expressed genes (DEGs), which were all subjected to enrichment analysis. Protein-protein interactions (PPIs), and the significant module and hub genes were identified via Cytoscape. Expression and correlation analysis of hub genes was done, and an overall survival analysis of lung cancer was performed. A receiver operating characteristic (ROC) curve analysis was performed to test the sensitivity and specificity of the identified hub genes for diagnosing lung cancer.
Results: The repeatability of the two datasets was good and 115 DEGs and 10 hub genes were identified. Functional analysis revealed that these DEGs were associated with cell adhesion, the extracellular matrix, and calcium ion binding. The DEGs were mainly involved with ECM-receptor interaction, ABC transporters, cell-adhesion molecules, and the p53 signaling pathway. Ten genes including COL1A2, POSTN, DSG2, CDKN2A, COL1A1, KRT19, SLC2A1, SERPINB5, DSC3, and SPP1 were identified as hub genes through module analysis in the PPI network. Lung cancer patients with high expression of COL1A2, POSTN, DSG2, CDKN2A, COL1A1, SLC2A1, SERPINB5, and SPP1 had poorer overall survival times than those with low expression (p <0.05). The CTD database showed that 10 hub genes were closely related to lung cancer. Expression of POSTN, DSG2, CDKN2A, COL1A1, SLC2A1, SERPINB5, and SPP1 was also associated with a diagnosis of lung cancer (p<0.05). ROC analysis showed that SPP1 (AUC = 0.940, p = 0.000*, 95%CI = 0.930-0.973, ODT = 7.004), SLC2A1 (AUC = 0.889, p = 0.000*, 95%CI = 0.791-0.865, ODT = 7.123), CDKN2A (AUC = 0.730, p = 0.000*, 95%CI = 0.465-1.000, ODT = 6.071) were suitable biomarkers.
Conclusion: Microarray technology represents an effective method for exploring genetic targets and molecular mechanisms of lung cancer. In addition, the identification of hub genes of lung cancer provides novel research insights for the diagnosis and treatment of lung cancer.
Keywords: Lung cancer, hub gene, differentially expressed genes, bioinformatics, overall survival, microarray data.
[http://dx.doi.org/10.2174/1389450116666150309115922] [PMID: 25751009]
[http://dx.doi.org/10.2174/156800911794328457] [PMID: 21062238]
[http://dx.doi.org/10.1007/s11356-018-1315-9] [PMID: 29359248]
[http://dx.doi.org/10.2174/156800911794328466] [PMID: 21247391]
[http://dx.doi.org/10.2174/0929867323666160405111152] [PMID: 27048343]
[http://dx.doi.org/10.1002/bmc.2953] [PMID: 23843248]
[http://dx.doi.org/10.2174/156800911794328493] [PMID: 21158724]
[http://dx.doi.org/10.4155/fmc.13.62] [PMID: 23682571]
[http://dx.doi.org/10.2174/157340613804488297] [PMID: 22741786]
[http://dx.doi.org/10.1158/1055-9965.EPI-15-0578] [PMID: 26667886]
[http://dx.doi.org/10.18632/aging.102231] [PMID: 31487691]
[http://dx.doi.org/10.1038/sj.leu.2403769] [PMID: 15843818]
[http://dx.doi.org/10.1016/j.micpath.2012.04.005] [PMID: 22575887]
[http://dx.doi.org/10.1177/1947601912465177] [PMID: 23264847]
[http://dx.doi.org/10.1038/srep36177] [PMID: 27796337]
[http://dx.doi.org/10.1186/s12944-019-1056-x] [PMID: 31043156]
[http://dx.doi.org/10.1089/cmb.2019.0184] [PMID: 31424286]
[http://dx.doi.org/10.1038/ncomms12846] [PMID: 27667448]
[http://dx.doi.org/10.1038/nbt0308-303] [PMID: 18327243]
[http://dx.doi.org/10.7150/ijbs.17390] [PMID: 28123349]
[http://dx.doi.org/10.1093/nar/gku1003] [PMID: 25352553]
[http://dx.doi.org/10.1093/bioinformatics/btq675] [PMID: 21149340]
[http://dx.doi.org/10.1186/1471-2105-4-2] [PMID: 12525261]
[http://dx.doi.org/10.1038/75556] [PMID: 10802651]
[http://dx.doi.org/10.1093/nar/gkw1092] [PMID: 27899662]
[http://dx.doi.org/10.1093/bioinformatics/bti551] [PMID: 15972284]
[http://dx.doi.org/10.1038/nprot.2008.211] [PMID: 19131956]
[http://dx.doi.org/10.1038/s41467-019-09234-6] [PMID: 30944313]
[http://dx.doi.org/10.1093/nar/gkx247] [PMID: 28407145]
[http://dx.doi.org/10.1093/nar/gkw838] [PMID: 27651457]
[http://dx.doi.org/10.2147/IJN.S107345] [PMID: 27695321]
[http://dx.doi.org/10.1016/j.lungcan.2018.01.016] [PMID: 29572006]
[http://dx.doi.org/10.3892/ol.2018.8882] [PMID: 30008938]
[http://dx.doi.org/10.12659/MSM.916962] [PMID: 31181055]
[http://dx.doi.org/10.1016/j.yexcr.2017.08.028] [PMID: 28830685]
[http://dx.doi.org/10.1158/0008-5472.CAN-19-0842] [PMID: 31653686]
[http://dx.doi.org/10.3727/096504018X15426271404407] [PMID: 30832751]
[http://dx.doi.org/10.1016/j.ejmech.2017.08.029] [PMID: 28837922]
[http://dx.doi.org/10.18632/aging.102521] [PMID: 31816603]
[http://dx.doi.org/10.3892/or.2012.2087] [PMID: 23076555]
[http://dx.doi.org/10.2147/OTT.S171705] [PMID: 30425528]
[http://dx.doi.org/10.1093/annonc/mdq056] [PMID: 20231302]
[http://dx.doi.org/10.3389/fonc.2017.00306] [PMID: 29376023]
[http://dx.doi.org/10.1002/ijc.26166] [PMID: 21544813]
[http://dx.doi.org/10.1007/s13402-018-0404-6] [PMID: 30178167]