Abstract
Background: Renal cell carcinoma (RCC) is the most common malignant tumor of the adult kidney.
Objective: The aim of this study was to identify key genes signatures during RCC and uncover their potential mechanisms.
Methods: Firstly, the gene expression profiles of GSE53757 which contained 144 samples, including 72 kidney cancer samples and 72 controls, were downloaded from the GEO database. And then differentially expressed genes (DEGs) between the kidney cancer samples and the controls were identified. After that, GO and KEGG enrichment analyses of DEGs were performed by DAVID. Furthermore, the correlation-based feature subset (CFS) method was applied to the selection of key genes of DEGs. In addition, the classification model between the kidney cancer samples and the controls was built by Adaboost based on the selected key genes.
Results: 213 DEGs including 80 up-regulated and 133 down-regulated genes were selected as the feature genes to build the classification model between the kidney cancer samples and the controls by CFS method. The accuracy of the classification model by using 5-folds cross-validation test and independent set test is 84.4% and 83.3%, respectively. Besides, TYROBP, CD4163, CAV1, CXCL9, CXCL11 and CXCL13 also can be found in the top 20 hub genes screened by proteinprotein interaction (PPI) network.
Conclusion: It indicated that CFS is a useful tool to identify key genes in kidney cancer. Besides, we also predicted genes such as TYROBP, CD4163, CAV1, CXCL9, CXCL11 and CXCL13 that might target genes to diagnose the kidney cancer.
Keywords: Gene expression profiles, gene selection, renal cell carcinoma, correlation-based feature subset (CFS), Adaboost, gene ontology.
Graphical Abstract
[http://dx.doi.org/10.1155/2017/1972097] [PMID: 28691016]
[http://dx.doi.org/10.1007/s11030-017-9732-0] [PMID: 28275924]
[http://dx.doi.org/10.1155/2017/4649191] [PMID: 28630865]
[http://dx.doi.org/10.1053/j.gastro.2014.12.028] [PMID: 25557953]
[http://dx.doi.org/10.1016/j.jneumeth.2008.02.022] [PMID: 18455804]
[http://dx.doi.org/10.1186/1471-2105-11-583] [PMID: 21118506]
[http://dx.doi.org/10.1093/biostatistics/4.2.249] [PMID: 12925520]
[http://dx.doi.org/10.1093/bioinformatics/btm412] [PMID: 17720982]
[PMID: 14681407]
[PMID: 23161678]
[PMID: 18077471]
[http://dx.doi.org/10.1038/nprot.2008.211] [PMID: 19131956]
[http://dx.doi.org/10.1093/nar/gkn923] [PMID: 19033363]
[http://dx.doi.org/10.1093/nar/gkm415] [PMID: 17576678]
[http://dx.doi.org/10.1093/nar/gkq973] [PMID: 21045058]
[http://dx.doi.org/10.1093/nar/gku1003] [PMID: 25352553]
[http://dx.doi.org/10.2174/1567205016666190103154855]
[PMID: 22591479]
[http://dx.doi.org/10.2174/1389450119666180809122244] [PMID: 30091413]
[http://dx.doi.org/10.1016/j.csbj.2018.11.007] [PMID: 30595814]
[http://dx.doi.org/10.1109/TCYB.2016.2623900] [PMID: 27898387]
[http://dx.doi.org/10.2174/092986606776819619] [PMID: 16800803]
[http://dx.doi.org/10.1016/j.chemolab.2008.11.003]
[http://dx.doi.org/10.1007/s11030-009-9116-1] [PMID: 19219560]
[http://dx.doi.org/10.1016/j.chemolab.2011.06.007]
[http://dx.doi.org/10.2174/092986612803521585] [PMID: 22670666]
[http://dx.doi.org/10.2174/1573406413666170522150940] [PMID: 28530547]
[http://dx.doi.org/10.2174/1574893611308050007]
[http://dx.doi.org/10.1016/j.bbapap.2013.07.008] [PMID: 23907006]
[http://dx.doi.org/10.18632/oncotarget.17210] [PMID: 28467816]
[http://dx.doi.org/10.18632/oncotarget.19169] [PMID: 28915661]
[http://dx.doi.org/10.2174/092986608784966921] [PMID: 18680454]
[http://dx.doi.org/10.1002/jcc.21024] [PMID: 18496789]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.1007/BF00058655]
[http://dx.doi.org/10.1073/pnas.1800256115] [PMID: 29440440]
[http://dx.doi.org/10.2174/1386207319666151110122931] [PMID: 26552439]
[http://dx.doi.org/10.4049/jimmunol.178.4.2278] [PMID: 17277133]
[http://dx.doi.org/10.1084/jem.20052144] [PMID: 16940167]
[PMID: 9670971]
[http://dx.doi.org/10.1016/j.clbc.2013.05.003] [PMID: 23810293]
[http://dx.doi.org/10.1016/S0002-9440(10)63920-X] [PMID: 12651616]