Abstract
Background: Human microbial communities play an important role in some physiological process of human beings. Nevertheless, the identification of microbe-disease associations through biological experiments is costly and time-consuming. Hence, the development of calculation models is meaningful to infer latent associations between microbes and diseases.
Aims: In this manuscript, we aim to design a computational model based on the Graph Convolutional Neural Network with Multi-layer Attention mechanism, called GCNMA, to infer latent microbe-disease associations.
Objective: This study aims to propose a novel computational model based on the Graph Convolutional Neural Network with Multi-layer Attention mechanism, called GCNMA, to detect potential microbedisease associations.
Methods: In GCNMA, the known microbe-disease association network was first integrated with the microbe- microbe similarity network and the disease-disease similarity network into a heterogeneous network first. Subsequently, the graph convolutional neural network was implemented to extract embedding features of each layer for microbes and diseases respectively. Thereafter, these embedding features of each layer were fused together by adopting the multi-layer attention mechanism derived from the graph convolutional neural network, based on which, a bilinear decoder would be further utilized to infer possible associations between microbes and diseases.
Results: Finally, to evaluate the predictive ability of GCNMA, intensive experiments were done and compared results with eight state-of-the-art methods which demonstrated that under the frameworks of both 2-fold cross-validations and 5-fold cross-validations, GCNMA can achieve satisfactory prediction performance based on different databases including HMDAD and Disbiome simultaneously. Moreover, case studies on three kinds of common diseases such as asthma, type 2 diabetes, and inflammatory bowel disease verified the effectiveness of GCNMA as well.
Conclusion: GCNMA outperformed 8 state-of-the-art competitive methods based on the benchmarks of both HMDAD and Disbiome.
[http://dx.doi.org/10.1126/science.1124234] [PMID: 16741115]
[http://dx.doi.org/10.1016/j.chom.2014.08.014] [PMID: 25211071]
[http://dx.doi.org/10.1371/journal.pbio.1002533]
[http://dx.doi.org/10.1038/nature10213] [PMID: 21677749]
[http://dx.doi.org/10.1016/S0140-6736(03)12489-0] [PMID: 12583961]
[http://dx.doi.org/10.1007/s12275-018-8032-4] [PMID: 29492874]
[http://dx.doi.org/10.1073/pnas.0812600106] [PMID: 19164560]
[http://dx.doi.org/10.1038/nrgastro.2012.152] [PMID: 22907164]
[http://dx.doi.org/10.1046/j.1440-1746.2002.02834.x] [PMID: 12164960]
[http://dx.doi.org/10.1080/01635581.2017.1263750] [PMID: 28094541]
[http://dx.doi.org/10.1016/j.cell.2016.11.018] [PMID: 27912057]
[http://dx.doi.org/10.1371/journal.pone.0227147] [PMID: 31995569]
[http://dx.doi.org/10.1038/nrn3346] [PMID: 22968153]
[http://dx.doi.org/10.1016/j.neuroscience.2010.08.005] [PMID: 20696216]
[http://dx.doi.org/10.1093/bib/bbw005] [PMID: 26883326]
[http://dx.doi.org/10.1186/s12866-018-1197-5] [PMID: 29866037]
[PMID: 33418085]
[http://dx.doi.org/10.1186/s12859-021-04346-7]
[http://dx.doi.org/10.1093/nar/gkaa902] [PMID: 33080028]
[http://dx.doi.org/10.1093/bioinformatics/btw715]
[http://dx.doi.org/10.1038/s41598-017-08127-2] [PMID: 28790448]
[http://dx.doi.org/10.1093/bib/bbaa146] [PMID: 32725163]
[PMID: 31397851]
[http://dx.doi.org/10.1093/bioinformatics/btz965] [PMID: 31904845]
[http://dx.doi.org/10.1093/bib/bbaa243] [PMID: 33078832]
[http://dx.doi.org/10.1371/journal.pcbi.1005366] [PMID: 28152007]
[http://dx.doi.org/10.1093/bioinformatics/btl467] [PMID: 16954137]
[http://dx.doi.org/10.1093/nar/gky1126] [PMID: 30418591]
[http://dx.doi.org/10.1093/bioinformatics/btaa598] [PMID: 32597948]
[http://dx.doi.org/10.1093/bioinformatics/bty294] [PMID: 29949996]
[http://dx.doi.org/10.18653/v1/P19-1617]
[http://dx.doi.org/10.1109/TCBB.2021.3116318] [PMID: 34587092]
[http://dx.doi.org/10.1093/bioinformatics/btaa891] [PMID: 33381844]
[http://dx.doi.org/10.1145/3397271.3401063]
[http://dx.doi.org/10.1109/BIBM47256.2019.8983350]
[http://dx.doi.org/10.1109/WACV.2017.58]
[http://dx.doi.org/10.48550/arXiv.1801.07606]
[http://dx.doi.org/10.1007/978-3-319-95957-3_78]
[http://dx.doi.org/10.3389/fmicb.2019.00684] [PMID: 31024481]
[http://dx.doi.org/10.3389/fbioe.2020.00831] [PMID: 32850711]
[http://dx.doi.org/10.1109/TCBB.2020.2986459]
[http://dx.doi.org/10.4103/atm.ATM_697_20] [PMID: 33680125]
[http://dx.doi.org/10.1056/NEJMoa1211592] [PMID: 23534543]
[http://dx.doi.org/10.1186/s12931-016-0479-4] [PMID: 27919249]
[http://dx.doi.org/10.1016/S0140-6736(07)60750-8] [PMID: 17499605]
[http://dx.doi.org/10.1053/gast.2001.22122] [PMID: 11179240]
[http://dx.doi.org/10.3748/wjg.v20.i1.91] [PMID: 24415861]
[http://dx.doi.org/10.1038/s41598-017-10034-5] [PMID: 28842640]
[http://dx.doi.org/10.1056/NEJMoa012512] [PMID: 11832527]
[http://dx.doi.org/10.1056/NEJM200105033441801] [PMID: 11333990]