Abstract
Background: Predicting drug-related associations is an important task in drug development and discovery. With the rapid advancement of high-throughput technologies and various biological and medical data, artificial intelligence (AI), especially progress in machine learning (ML) and deep learning (DL), has paved a new way for the development of drug-related associations prediction. Many studies have been conducted in the literature to predict drug-related associations. This study looks at various computational methods used for drug-related associations prediction with the hope of getting a better insight into the computational methods used.
Methods: The various computational methods involved in drug-related associations prediction have been reviewed in this work. We have first summarized the drug, target, and disease-related mainstream public datasets. Then, we have discussed existing drug similarity, target similarity, and integrated similarity measurement approaches and grouped them according to their suitability. We have then comprehensively investigated drug-related associations and introduced relevant computational methods. Finally, we have briefly discussed the challenges involved in predicting drug-related associations.
Results: We discovered that quite a few studies have used implemented ML and DL approaches for drug-related associations prediction. The key challenges were well noted in constructing datasets with reasonable negative samples, extracting rich features, and developing powerful prediction models or ensemble strategies.
Conclusion: This review presents useful knowledge and future challenges on the subject matter with the hope of promoting further studies on predicting drug-related associations.
Graphical Abstract
[http://dx.doi.org/10.7150/jca.63517] [PMID: 34729122]
[http://dx.doi.org/10.1109/BIBM52615.2021.9669497]
[http://dx.doi.org/10.2147/IDR.S258037] [PMID: 32765017]
[http://dx.doi.org/10.1093/bib/bbx017] [PMID: 28334136]
[http://dx.doi.org/10.3390/molecules25225277] [PMID: 33198233]
[http://dx.doi.org/10.1007/978-1-0716-0826-5_7] [PMID: 32804365]
[http://dx.doi.org/10.1002/med.21764] [PMID: 33295676]
[http://dx.doi.org/10.1007/s11814-023-1377-3] [PMID: 36748027]
[http://dx.doi.org/10.1093/bib/bbaa256] [PMID: 33126246]
[http://dx.doi.org/10.1093/bioinformatics/btab207] [PMID: 33769494]
[http://dx.doi.org/10.1038/s41397-021-00246-4] [PMID: 34155353]
[http://dx.doi.org/10.1021/acs.chemrestox.9b00238] [PMID: 31777246]
[http://dx.doi.org/10.1109/JBHI.2020.3048059] [PMID: 33373310]
[http://dx.doi.org/10.1093/bib/bbaa040] [PMID: 32349125]
[http://dx.doi.org/10.1093/bib/bbab133] [PMID: 33951725]
[http://dx.doi.org/10.1093/bib/bbac209] [PMID: 35667078]
[http://dx.doi.org/10.1093/bib/bbab441] [PMID: 34695842]
[http://dx.doi.org/10.1371/journal.pcbi.1010812] [PMID: 36701288]
[http://dx.doi.org/10.3389/fgene.2021.702259] [PMID: 34504515]
[http://dx.doi.org/10.1109/JBHI.2021.3121798] [PMID: 34673498]
[http://dx.doi.org/10.1093/bib/bbab453] [PMID: 34718408]
[http://dx.doi.org/10.1093/bioinformatics/btac485] [PMID: 35801934]
[http://dx.doi.org/10.1093/bioinformatics/btaa451] [PMID: 32657406]
[http://dx.doi.org/10.1093/bioinformatics/btac377] [PMID: 35652721]
[http://dx.doi.org/10.1093/bioinformatics/btz682] [PMID: 31501885]
[http://dx.doi.org/10.1145/3458754]
[http://dx.doi.org/10.1093/bib/bbac409] [PMID: 36156661]
[http://dx.doi.org/10.1093/nar/gkx1037] [PMID: 29126136]
[http://dx.doi.org/10.1021/ci3001277] [PMID: 22587354]
[http://dx.doi.org/10.1093/nar/gkr777] [PMID: 21948594]
[http://dx.doi.org/10.1093/nar/gkv951] [PMID: 26400175]
[http://dx.doi.org/10.1093/nar/gkab953] [PMID: 34718717]
[http://dx.doi.org/10.1093/nar/gkr912] [PMID: 22067455]
[http://dx.doi.org/10.1093/nar/gkv1070] [PMID: 26476454]
[http://dx.doi.org/10.1093/nar/30.1.163] [PMID: 11752281]
[http://dx.doi.org/10.1093/nar/gkaa891] [PMID: 33068428]
[http://dx.doi.org/10.1021/acs.jcim.7b00175] [PMID: 28906116]
[http://dx.doi.org/10.1093/nar/gkt1207] [PMID: 24293645]
[http://dx.doi.org/10.1093/nar/gkab880] [PMID: 34634800]
[http://dx.doi.org/10.1093/nar/gkl999] [PMID: 17145705]
[http://dx.doi.org/10.1038/msb.2009.98] [PMID: 20087340]
[http://dx.doi.org/10.1126/science.1132939] [PMID: 17008526]
[http://dx.doi.org/10.3389/fcimb.2018.00424] [PMID: 30581775]
[http://dx.doi.org/10.1093/nar/gkx1157] [PMID: 29156005]
[http://dx.doi.org/10.1016/j.ijid.2020.02.018]
[http://dx.doi.org/10.1101/gr.1680803] [PMID: 14525934]
[http://dx.doi.org/10.1093/bioinformatics/btac793] [PMID: 36484697]
[http://dx.doi.org/10.1093/nar/gkt1180] [PMID: 24288376]
[http://dx.doi.org/10.1093/nar/gku1003] [PMID: 25352553]
[http://dx.doi.org/10.1038/75556] [PMID: 10802651]
[PMID: 31680165]
[http://dx.doi.org/10.1186/s12864-017-3911-3] [PMID: 28812536]
[http://dx.doi.org/10.1016/S0140-6736(19)31205-X] [PMID: 31180012]
[http://dx.doi.org/10.1093/nar/gku1205] [PMID: 25428349]
[http://dx.doi.org/10.1021/ci00057a005]
[http://dx.doi.org/10.1021/ci025584y] [PMID: 12653513]
[http://dx.doi.org/10.1093/nar/gkq367] [PMID: 20460463]
[http://dx.doi.org/10.1089/cmb.2010.0213]
[PMID: 15137231]
[http://dx.doi.org/10.1039/C9SC04336E] [PMID: 34123272]
[http://dx.doi.org/10.2174/1574886313666181026100000] [PMID: 30362421]
[http://dx.doi.org/10.1093/bioinformatics/bts413] [PMID: 22962489]
[http://dx.doi.org/10.1016/j.compbiolchem.2017.03.011] [PMID: 28648470]
[http://dx.doi.org/10.1371/journal.pone.0021132] [PMID: 21731656]
[http://dx.doi.org/10.1109/TBME.2016.2573285] [PMID: 27740470]
[http://dx.doi.org/10.1016/j.jbi.2021.103711] [PMID: 33610881]
[http://dx.doi.org/10.1016/j.ijmedinf.2019.02.003] [PMID: 30784433]
[http://dx.doi.org/10.1186/s12859-016-1415-9] [PMID: 28056782]
[http://dx.doi.org/10.1038/s41598-019-50121-3] [PMID: 31541145]
[http://dx.doi.org/10.1186/s12859-016-1336-7] [PMID: 28155639]
[http://dx.doi.org/10.1016/0022-2836(81)90087-5] [PMID: 7265238]
[http://dx.doi.org/10.1093/bioinformatics/btq064] [PMID: 20179076]
[http://dx.doi.org/10.1093/bioinformatics/btp433] [PMID: 19605421]
[http://dx.doi.org/10.1186/s12859-019-2811-8] [PMID: 31138103]
[http://dx.doi.org/10.1038/nmeth.2689] [PMID: 24122041]
[http://dx.doi.org/10.1613/jair.514]
[http://dx.doi.org/10.1007/s12539-021-00424-9] [PMID: 33761117]
[http://dx.doi.org/10.1093/bioinformatics/btm087] [PMID: 17344234]
[http://dx.doi.org/10.1093/bioinformatics/btm212] [PMID: 17646309]
[http://dx.doi.org/10.1186/s12859-016-0890-3] [PMID: 26801218]
[http://dx.doi.org/10.1093/nar/gkn582] [PMID: 18776214]
[http://dx.doi.org/10.1093/bioinformatics/btn409] [PMID: 18676415]
[http://dx.doi.org/10.1371/journal.pone.0080129] [PMID: 24278248]
[http://dx.doi.org/10.1155/2017/2713280]
[http://dx.doi.org/10.1021/acsomega.0c05377] [PMID: 33553921]
[http://dx.doi.org/10.1093/bioinformatics/btx731] [PMID: 29186331]
[http://dx.doi.org/10.1109/TCBB.2020.2977335] [PMID: 32142454]
[http://dx.doi.org/10.1109/TCBB.2020.2988018] [PMID: 32310779]
[http://dx.doi.org/10.1039/C5MB00615E] [PMID: 26675534]
[http://dx.doi.org/10.1021/ci400010x] [PMID: 23527559]
[http://dx.doi.org/10.1016/j.compbiolchem.2018.11.028] [PMID: 30528728]
[http://dx.doi.org/10.1016/j.aca.2016.01.014] [PMID: 26851083]
[http://dx.doi.org/10.1038/nmeth.2810] [PMID: 24464287]
[http://dx.doi.org/10.1007/s13721-019-0215-3]
[http://dx.doi.org/10.1007/978-3-030-60802-6_32]
[http://dx.doi.org/10.1039/D0RA02297G] [PMID: 35517730]
[http://dx.doi.org/10.1109/BIBM49941.2020.9313489]
[http://dx.doi.org/10.1093/bib/bbab319] [PMID: 34378011]
[http://dx.doi.org/10.1186/s12859-020-3379-z] [PMID: 32033537]
[http://dx.doi.org/10.1016/j.jbi.2019.103159] [PMID: 30926470]
[http://dx.doi.org/10.1093/bioinformatics/btn162] [PMID: 18586719]
[http://dx.doi.org/10.1038/s41586-021-03819-2] [PMID: 34265844]
[http://dx.doi.org/10.1126/science.abj8754] [PMID: 34282049]
[http://dx.doi.org/10.3390/ijms24087119] [PMID: 37108279]
[http://dx.doi.org/10.15252/msb.202211081] [PMID: 36065847]
[http://dx.doi.org/10.1002/prot.26382] [PMID: 35510704]
[http://dx.doi.org/10.1038/s41598-021-82410-1] [PMID: 33542326]
[http://dx.doi.org/10.1021/ct4004228] [PMID: 24124403]
[http://dx.doi.org/10.1080/17460441.2022.2114451] [PMID: 35983695]
[http://dx.doi.org/10.1371/journal.pone.0066952] [PMID: 23840562]
[http://dx.doi.org/10.1093/bioinformatics/btr500] [PMID: 21893517]
[http://dx.doi.org/10.1002/minf.201400009] [PMID: 27485302]
[http://dx.doi.org/10.1093/bioinformatics/btx160] [PMID: 28430977]
[http://dx.doi.org/10.1145/2623330.2623732]
[http://dx.doi.org/10.1021/acs.jproteome.6b00618] [PMID: 28264154]
[http://dx.doi.org/10.1093/bioinformatics/bty593] [PMID: 30423097]
[http://dx.doi.org/10.1186/s12911-020-1052-0] [PMID: 32183788]
[http://dx.doi.org/10.1093/bioinformatics/btaa921] [PMID: 33119053]
[http://dx.doi.org/10.1093/bioinformatics/bty543] [PMID: 30561548]
[http://dx.doi.org/10.1093/bib/bbab346] [PMID: 34661237]
[http://dx.doi.org/10.1016/j.compbiolchem.2021.107476] [PMID: 33799080]
[http://dx.doi.org/10.1093/bioinformatics/btaa880] [PMID: 33070179]
[http://dx.doi.org/10.1016/j.compbiomed.2022.105214] [PMID: 35030496]
[http://dx.doi.org/10.1093/bioinformatics/btac648] [PMID: 36205562]
[http://dx.doi.org/10.1155/2018/1425608] [PMID: 30627536]
[http://dx.doi.org/10.1371/journal.pcbi.1004760] [PMID: 26872142]
[http://dx.doi.org/10.1007/s11030-022-10492-8] [PMID: 35871213]
[http://dx.doi.org/10.1371/journal.pcbi.1002503] [PMID: 22589709]
[http://dx.doi.org/10.1039/c2mb00002d] [PMID: 22538619]
[http://dx.doi.org/10.1371/journal.pcbi.1007068] [PMID: 31125330]
[http://dx.doi.org/10.1007/s12539-023-00550-6] [PMID: 36646843]
[http://dx.doi.org/10.1038/msb.2012.26] [PMID: 22806140]
[http://dx.doi.org/10.1016/j.jbi.2017.04.021] [PMID: 28465082]
[http://dx.doi.org/10.1073/pnas.1803294115] [PMID: 29666228]
[http://dx.doi.org/10.24963/ijcai.2019/628]
[http://dx.doi.org/10.1016/j.ymeth.2020.05.007] [PMID: 32497603]
[http://dx.doi.org/10.1016/j.ymeth.2019.02.021] [PMID: 30822516]
[http://dx.doi.org/10.1186/s12859-020-03724-x] [PMID: 32972364]
[http://dx.doi.org/10.1186/s12859-023-05212-4] [PMID: 36918766]
[http://dx.doi.org/10.1093/bioinformatics/bty294] [PMID: 29949996]
[http://dx.doi.org/10.1145/3307339.3342161]
[http://dx.doi.org/10.1186/s12859-022-04876-8] [PMID: 35965308]
[http://dx.doi.org/10.1016/j.artmed.2021.102153] [PMID: 34531012]
[http://dx.doi.org/10.1093/bib/bbab421] [PMID: 34671814]
[http://dx.doi.org/10.1038/srep12339] [PMID: 26196247]
[http://dx.doi.org/10.1093/bioinformatics/btw342] [PMID: 27354693]
[http://dx.doi.org/10.1186/s12859-018-2379-8] [PMID: 30453924]
[http://dx.doi.org/10.1016/j.jbi.2018.11.005] [PMID: 30445219]
[http://dx.doi.org/10.1016/j.ins.2019.05.017]
[http://dx.doi.org/10.1109/JBHI.2023.3246225] [PMID: 37027562]
[http://dx.doi.org/10.1186/1471-2105-11-S5-P9]
[http://dx.doi.org/10.1186/1471-2105-12-S2-S1]
[http://dx.doi.org/10.1093/bioinformatics/btw486] [PMID: 27466626]
[http://dx.doi.org/10.1016/j.artmed.2018.03.001] [PMID: 29559249]
[http://dx.doi.org/10.1016/j.jbi.2018.08.005] [PMID: 30142385]
[http://dx.doi.org/10.1016/j.jbi.2020.103451] [PMID: 32454243]
[http://dx.doi.org/10.2174/1381612826666200612163819] [PMID: 32532187]
[http://dx.doi.org/10.1021/ci2005548] [PMID: 23157436]
[http://dx.doi.org/10.1186/1471-2164-12-S5-S11]
[http://dx.doi.org/10.1186/s12918-017-0477-2] [PMID: 29297371]
[http://dx.doi.org/10.1186/1471-2105-12-169] [PMID: 21586169]
[http://dx.doi.org/10.1016/j.neucom.2015.08.054]
[http://dx.doi.org/10.1186/s12859-015-0774-y] [PMID: 26537615]
[http://dx.doi.org/10.1155/2020/4675395] [PMID: 32596314]
[http://dx.doi.org/10.1109/JBHI.2018.2883834] [PMID: 30507518]
[PMID: 29994681]
[http://dx.doi.org/10.1016/j.ebiom.2020.102837] [PMID: 32565027]
[http://dx.doi.org/10.1186/s12911-021-01402-3] [PMID: 33541342]
[http://dx.doi.org/10.1016/j.jbi.2014.05.013] [PMID: 24928448]
[http://dx.doi.org/10.1007/s40264-018-0688-5] [PMID: 29876834]
[http://dx.doi.org/10.1016/j.jbi.2018.09.015] [PMID: 30268842]
[PMID: 29295151]
[http://dx.doi.org/10.3390/genes10020159] [PMID: 30791472]
[http://dx.doi.org/10.1038/s41467-020-18305-y] [PMID: 32917868]
[http://dx.doi.org/10.1093/bib/bbab449] [PMID: 34718402]
[http://dx.doi.org/10.1093/bib/bbab586] [PMID: 35043189]
[http://dx.doi.org/10.1093/bib/bbab239] [PMID: 34213525]
[http://dx.doi.org/10.1093/bib/bbac126] [PMID: 35470853]
[http://dx.doi.org/10.1093/bib/bbaa267] [PMID: 33147616]
[http://dx.doi.org/10.1039/D1MO00237F] [PMID: 34610633]
[http://dx.doi.org/10.1089/cmb.2019.0063]
[http://dx.doi.org/10.3389/fchem.2019.00924] [PMID: 31998700]
[http://dx.doi.org/10.3934/mbe.2021367] [PMID: 34814256]
[http://dx.doi.org/10.3389/fphar.2019.01301] [PMID: 31780934]
[http://dx.doi.org/10.3390/cells8070705] [PMID: 31336774]
[http://dx.doi.org/10.1186/s12859-021-04406-y] [PMID: 34717542]
[http://dx.doi.org/10.1093/bib/bbab515] [PMID: 34891172]
[http://dx.doi.org/10.1109/JBHI.2020.3039502] [PMID: 33216722]
[http://dx.doi.org/10.1109/JBHI.2023.3272154] [PMID: 37163398]
[http://dx.doi.org/10.15252/msb.202010116] [PMID: 33734582]
[http://dx.doi.org/10.1109/BIBM47256.2019.8983209]
[http://dx.doi.org/10.1007/BF02289026]
[http://dx.doi.org/10.1093/bioinformatics/btaa598] [PMID: 32597948]
[http://dx.doi.org/10.1093/bioinformatics/btaa891] [PMID: 33381844]
[http://dx.doi.org/10.1109/JBHI.2020.2998906] [PMID: 32750918]
[http://dx.doi.org/10.1016/j.ymeth.2021.08.003] [PMID: 34419588]
[PMID: 34864873]
[http://dx.doi.org/10.1038/s41598-023-34438-8] [PMID: 37149692]
[http://dx.doi.org/10.1109/JBHI.2022.3233711]