Abstract
Introduction: Drug repositioning aims to screen drugs and therapeutic goals from approved drugs and abandoned compounds that have been identified as safe. This trend is changing the landscape of drug development and creating a model of drug repositioning for new drug development. In the recent decade, machine learning methods have been applied to predict the binding affinity of compound proteins, while deep learning is recently becoming prominent and achieving significant performances. Among the models, the way of representing the compounds is usually simple, which is the molecular fingerprints, i.e., a single SMILES string.
Methods: In this study, we improve previous work by proposing a novel representing manner, named SMILES#, to recode the SMILES string. This approach takes into account the properties of compounds and achieves superior performance. After that, we propose a deep learning model that combines recurrent neural networks with a convolutional neural network with an attention mechanism, using unlabeled data and labeled data to jointly encode molecules and predict binding affinity.
Results: Experimental results show that SMILES# with compound properties can effectively improve the accuracy of the model and reduce the RMS error on most data sets.
Conclusion: We used the method to verify the related and unrelated compounds with the same target, and the experimental results show the effectiveness of the method.
Keywords: Deep learning, drug repositioning, drug-target interactions, IC50 value, SMILES string, compound properties
Graphical Abstract
[http://dx.doi.org/10.1039/c2mb25110h] [PMID: 22751809]
[http://dx.doi.org/10.1504/IJAIS.2015.074410]
[http://dx.doi.org/10.1504/IJAIS.2016.10011128]
[http://dx.doi.org/10.1504/IJAIS.2019.108397]
[http://dx.doi.org/10.1504/IJAIS.2019.108402]
[http://dx.doi.org/10.1504/IJAIS.2019.108417]
[http://dx.doi.org/10.1504/IJAIS.2019.108428]
[http://dx.doi.org/10.1021/ci500747n] [PMID: 25635324]
[http://dx.doi.org/10.1093/nar/gkl999] [PMID: 17145705]
[http://dx.doi.org/10.1007/s11063-014-9352-y]
[http://dx.doi.org/10.1093/bioinformatics/btu739] [PMID: 25398609]
[http://dx.doi.org/10.1093/nar/gkv397] [PMID: 25943547]
[http://dx.doi.org/10.1093/bioinformatics/btz111] [PMID: 30768156]
[http://dx.doi.org/10.1109/CVPR.2018.00572]
[http://dx.doi.org/10.1093/nar/gkp456] [PMID: 19498078]
[http://dx.doi.org/10.1109/ACCESS.2019.2954903]
[http://dx.doi.org/10.1371/journal.pone.0108856] [PMID: 25303242]