Abstract
Background: Predicting drug-target interaction (DTI) plays a crucial role in drug research and development. More and more researchers pay attention to the problem of developing more powerful prediction methods. Traditional DTI prediction methods are basically realized by biochemical experiments, which are time-consuming, risky, and costly. Nowadays, DTI prediction is often solved by using a single information source and a single model, or by combining some models, but the prediction results are still not accurate enough.
Objective: The study aimed to utilize existing data and machine learning models to integrate heterogeneous data sources and different models, further improving the accuracy of DTI prediction.
Methods: This paper has proposed a novel prediction method based on reinforcement learning, called QLDTI (predicting drug-target interaction based on Q-learning), which can be mainly divided into two parts: data fusion and model fusion. Firstly, it fuses the drug and target similarity matrices calculated by different calculation methods through Q-learning. Secondly, the new similarity matrices are inputted into five models, NRLMF, CMF, BLM-NII, NetLapRLS, and WNN-GIP, for further training. Then, all sub-model weights are continuously optimized again by Q-learning, which can be used to linearly weight all sub-model prediction results to output the final prediction result.
Results: QLDTI achieved AUC accuracy of 99.04%, 99.12%, 98.28%, and 98.35% on E, NR, IC, and GPCR datasets, respectively. Compared to the existing five models NRLMF, CMF, BLM-NII, NetLapRLS, and WNN-GIP, the QLDTI method has achieved better results on four benchmark datasets of E, NR, IC, and GPCR.
Conclusion: Data fusion and model fusion have been proven effective for DTI prediction, further improving the prediction accuracy of DTI.
Graphical Abstract
[http://dx.doi.org/10.1016/j.jbi.2019.103159] [PMID: 30926470]
[http://dx.doi.org/10.3390/molecules23092208] [PMID: 30200333]
[http://dx.doi.org/10.1038/nbt1284] [PMID: 17287757]
[http://dx.doi.org/10.2174/157341208783497597]
[http://dx.doi.org/10.1038/s41586-021-03819-2] [PMID: 34265844]
[http://dx.doi.org/10.1007/978-1-62703-107-3_9]
[http://dx.doi.org/10.1517/17425255.2014.950222] [PMID: 25112457]
[http://dx.doi.org/10.1093/bioinformatics/bts412] [PMID: 22962471]
[http://dx.doi.org/10.1016/0022-2836(81)90087-5] [PMID: 7265238]
[http://dx.doi.org/10.1093/bib/bbaa267] [PMID: 33147616]
[http://dx.doi.org/10.1007/s00521-019-04569-z]
[http://dx.doi.org/10.1109/TCBB.2021.3135978]
[http://dx.doi.org/10.3390/app11188382]
[http://dx.doi.org/10.1016/j.ymeth.2021.10.007] [PMID: 34737033]
[http://dx.doi.org/10.1093/bioinformatics/btx731] [PMID: 29186331]
[http://dx.doi.org/10.1039/D0MO00062K] [PMID: 33084702]
[http://dx.doi.org/10.1371/journal.pcbi.1004760] [PMID: 26872142]
[http://dx.doi.org/10.1093/bioinformatics/bts670] [PMID: 23162055]
[http://dx.doi.org/10.1371/journal.pone.0066952] [PMID: 23840562]
[http://dx.doi.org/10.1093/nar/gkt1068] [PMID: 24203711]
[http://dx.doi.org/10.1093/nar/gkj102] [PMID: 16381885]
[PMID: 23203881]
[http://dx.doi.org/10.1093/nar/gkr912] [PMID: 22067455]
[http://dx.doi.org/10.1109/JBHI.2015.2513200] [PMID: 26731781]
[http://dx.doi.org/10.1093/nar/gkr777] [PMID: 21948594]