Abstract
With the development of sequencing technology, various forms of biomedical data, including genomics, transcriptomics, proteomics, microbiomics, and metabolomics data, are increasingly emerging. These data are an external manifestation of cell activity and mechanism. How to deeply analyze these data is critical to uncovering and understanding the nature of life. Due to the heterogeneousness and complexity of these data, it is a vastly challenging task for traditional machine learning to deal with it. Over the recent ten years, a new machine learning framework called graph neural networks (GNNs) has been proposed. The graph is a very powerful tool to represent a complex system. The GNNs is becoming a key to open the mysterious door of life. In this paper, we focused on summarizing state-ofthe- art GNNs algorithms (GraphSAGE, graph convolutional network, graph attention network, graph isomorphism network and graph auto-encoder), briefly introducing the main principles behind them. We also reviewed some applications of the GNNs to the area of biomedicine, and finally discussed the possible developing direction of GNNs in the future.
Keywords: Graph neural networks, graph convolutional network, graph attention network, graph auto-encoder, biomedical data, disease prediction.
[http://dx.doi.org/10.1016/j.cell.2020.03.022]
[http://dx.doi.org/10.1109/CVPR.2019.01175]
[http://dx.doi.org/10.1145/3178876.3186106]
[http://dx.doi.org/10.1145/3308558.3313562]
[http://dx.doi.org/10.1007/978-3-319-93417-4_38]
[http://dx.doi.org/10.1145/3219819.3219890]
[PMID: 32217482]
[http://dx.doi.org/10.1109/IJCNN.2010.5596796]
[http://dx.doi.org/10.1145/2939672.2939753]
[http://dx.doi.org/10.1007/978-3-030-04167-0_33]
[http://dx.doi.org/10.1360/N012019-00133]
[http://dx.doi.org/10.1016/j.neunet.2020.06.006]
[http://dx.doi.org/10.1145/2939672.2939754]
[http://dx.doi.org/10.1145/2623330.2623732]
[http://dx.doi.org/10.1109/IJCNN.2005.1555942]
[http://dx.doi.org/10.1145/3178876.3186116]
[http://dx.doi.org/10.1145/3219819.3220068]
[http://dx.doi.org/10.1145/3219819.3220000]
[http://dx.doi.org/10.1007/978-3-030-35817-4_5]
[http://dx.doi.org/10.1109/CVPR42600.2020.00386]
[http://dx.doi.org/10.1088/1742-6596/1642/1/012020]
[http://dx.doi.org/10.1007/978-3-030-20351-1_36]
[http://dx.doi.org/10.1093/bioinformatics/btaa428]
[http://dx.doi.org/10.1016/j.isci.2019.09.013]
[http://dx.doi.org/10.1007/s00438-020-01693-7]
[http://dx.doi.org/10.1038/nrg.2017.47]
[http://dx.doi.org/10.1093/bioinformatics/btaa211] [PMID: 32221609]
[http://dx.doi.org/10.1371/journal.pcbi.1007568] [PMID: 32433655]
[http://dx.doi.org/10.1007/978-3-319-66179-7_21]
[http://dx.doi.org/10.1016/j.media.2018.06.001] [PMID: 29890408]
[http://dx.doi.org/10.1109/ISBI.2019.8759274]
[http://dx.doi.org/10.1109/BIBM47256.2019.8983191]
[http://dx.doi.org/10.3390/cells8090977] [PMID: 31455028]
[PMID: 32749976]
[http://dx.doi.org/10.1016/j.ymeth.2020.05.010] [PMID: 32446956]
[http://dx.doi.org/10.1016/j.ymeth.2020.05.014] [PMID: 32622985]
[http://dx.doi.org/10.1039/D0RA02297G]
[http://dx.doi.org/10.1093/bioinformatics/btaa437]
[http://dx.doi.org/10.1093/bioinformatics/btaa822]
[http://dx.doi.org/10.1101/2020.04.07.030908]