Abstract
Background: Lung cancer is cancer with the highest incidence in the world, and there is obvious heterogeneity within its tumor. The emergence of single-cell sequencing technology allows researchers to obtain cell-type-specific expression genes at the single-cell level, thereby obtaining information regarding the cell status and subpopulation distribution, as well as the communication behavior between cells. Many researchers have applied this technology to lung cancer research, but due to the shortcomings of insufficient sequencing depth, only a small part of the gene expression can be detected. Researchers can only roughly compare whether a few thousand genes are significant in different cell types.
Methods: To fully explore the expression of all genes in different cell types, we propose a method to predict cell-type-specific genes. This method infers cell-type-specific genes based on the expression levels of genes in different tissues and cells and gene interactions. At present, biological experiments have discovered a large number of cell-type-specific genes, providing a large number of available samples for the application of deep learning methods.
Results: Therefore, we fused Graph Convolutional Network (GCN) with Convolutional Neural Network( CNN) to build, model, and inferred cell-type-specific genes of lung cancer in 8 cell types.
Conclusion: This method further analyzes and processes single-cell data and provides a new basis for research on heterogeneity in lung cancer tumor, microenvironment, invasion and metastasis, treatment response, drug resistance, etc.
Keywords: Cell-type-specific genes, single-cell sequencing, lung cancer, deep learning, chemotherapy, radiotherapy.
Graphical Abstract
[http://dx.doi.org/10.1056/NEJMoa1704795] [PMID: 28586279]
[http://dx.doi.org/10.1038/bjc.2013.721] [PMID: 24263064]
[http://dx.doi.org/10.1038/s41576-020-0265-5] [PMID: 32807900]
[http://dx.doi.org/10.1038/nature09807] [PMID: 21399628]
[http://dx.doi.org/10.1093/nar/gkaa838] [PMID: 33010177]
[PMID: 30388456]
[http://dx.doi.org/10.1016/j.cell.2017.12.007] [PMID: 29307488]
[http://dx.doi.org/10.1016/j.cell.2020.03.048] [PMID: 32302573]
[http://dx.doi.org/10.1038/s41421-020-00200-x] [PMID: 33083004]
[http://dx.doi.org/10.1016/j.cell.2020.07.017] [PMID: 32822576]
[http://dx.doi.org/10.1038/s41591-018-0096-5] [PMID: 29988129]
[http://dx.doi.org/10.1038/s41467-020-16164-1] [PMID: 32385277]
[http://dx.doi.org/10.1186/s13059-015-0692-3] [PMID: 26084335]
[http://dx.doi.org/10.1186/s12859-018-2226-y] [PMID: 29884114]
[http://dx.doi.org/10.1038/s41467-020-14976-9] [PMID: 32127540]
[http://dx.doi.org/10.1186/s12859-019-2977-0] [PMID: 31299886]
[http://dx.doi.org/10.1038/nmeth.4292] [PMID: 28504683]
[http://dx.doi.org/10.1093/bioinformatics/btz435] [PMID: 31125056]
[PMID: 31392316]
[http://dx.doi.org/10.1038/nrg3833] [PMID: 25628217]