Abstract
Background: LncRNA is not only involved in the regulation of the biological functions of protein-coding genes, but its dysfunction is also associated with the occurrence and progression of various diseases. Various studies have shown that an in-depth understanding of the mechanism of action of lncRNA is of great significance for disease treatment. However, traditional wet testing is time-consuming, laborious, expensive, and has many subjective factors which may affect the accuracy of the experiment.
Objective: Most of the methods for predicting lncRNA-protein interaction (LPI) rely on a single feature, or there is noise in the feature. To solve this problem, we proposed a computational model, CSALPI based on a deep neural network.
Methods: Firstly, this model utilizes cosine similarity to extract similarity features for lncRNAlncRNA and protein-protein, denoising similar features using the Sparse Autoencoder. Second, a neighbor enhancement autoencoder is employed to enforce neighboring nodes to be represented similarly by reconstructing the denoised features. Finally, a Light Gradient Boosting Machine classifier is used to predict potential LPIs.
Results: To demonstrate the reliability of CSALPI, multiple evaluation metrics were used under a 5- fold cross-validation experiment, and excellent results were achieved. In the case study, the model successfully predicted 7 out of 10 disease-associated lncRNA and protein pairs.
Conclusion: The CSALPI can be an effective complementary method for predicting potential LPIs from biological experiments.
[http://dx.doi.org/10.1126/science.aaj2239] [PMID: 28473536]
[http://dx.doi.org/10.1016/j.neunet.2022.09.026] [PMID: 36274524]
[http://dx.doi.org/10.1093/bib/bbab089] [PMID: 33834199]
[http://dx.doi.org/10.1007/s11427-014-4703-5] [PMID: 25104457]
[http://dx.doi.org/10.1093/bib/bbac083] [PMID: 35323894]
[http://dx.doi.org/10.1158/0008-5472.CAN-12-2850] [PMID: 23243023]
[http://dx.doi.org/10.1186/s12943-015-0458-2] [PMID: 26536864]
[PMID: 25859406]
[http://dx.doi.org/10.1093/bfgp/elu047] [PMID: 25504152]
[http://dx.doi.org/10.1101/gr.138545.112] [PMID: 23064747]
[http://dx.doi.org/10.1016/j.csbj.2019.11.004] [PMID: 31890140]
[http://dx.doi.org/10.18632/oncotarget.21934] [PMID: 29262614]
[http://dx.doi.org/10.3389/fgene.2018.00239] [PMID: 30023002]
[http://dx.doi.org/10.1093/bioinformatics/btz768] [PMID: 31598637]
[http://dx.doi.org/10.1016/j.omtn.2018.09.020] [PMID: 30388620]
[http://dx.doi.org/10.3389/fgene.2019.00343] [PMID: 31057602]
[http://dx.doi.org/10.3389/fgene.2020.615144] [PMID: 33362868]
[http://dx.doi.org/10.1186/s12859-020-03914-7] [PMID: 33461501]
[http://dx.doi.org/10.1093/nar/gkt1057] [PMID: 24217916]
[http://dx.doi.org/10.1093/nar/gkv1252] [PMID: 26586799]
[http://dx.doi.org/10.1093/nar/gkh131] [PMID: 14681372]
[http://dx.doi.org/10.3389/fgene.2021.814073] [PMID: 35186016]
[http://dx.doi.org/10.1109/TKDE.2023.3271677]
[http://dx.doi.org/10.1016/j.isci.2023.107478] [PMID: 37583550]
[http://dx.doi.org/10.1093/bib/bbac364] [PMID: 36130259]
[http://dx.doi.org/10.1186/s12967-023-03876-3] [PMID: 36698208]
[http://dx.doi.org/10.1016/j.csbj.2022.04.029] [PMID: 35521556]
[http://dx.doi.org/10.1093/bioinformatics/btad410] [PMID: 37369035]
[http://dx.doi.org/10.1145/2623330.2623732]
[http://dx.doi.org/10.1145/2736277.2741093]
[http://dx.doi.org/10.1145/2939672.2939753]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.1016/j.ajhg.2008.02.013] [PMID: 18371930]
[http://dx.doi.org/10.1016/j.gpb.2016.01.004] [PMID: 26917505]
[http://dx.doi.org/10.1371/journal.pcbi.1006616] [PMID: 30533006]
[http://dx.doi.org/10.1016/j.neucom.2017.07.065]
[http://dx.doi.org/10.1002/pros.23120] [PMID: 26764246]
[http://dx.doi.org/10.1038/ng.132] [PMID: 18372902]