Abstract
Background: There has been a growing interest in discovering a viable drug for the new coronavirus (SARS-CoV-2) since the beginning of the pandemic. Protein-ligand interaction studies are a crucial step in the drug discovery process, as it helps us narrow the search space for potential ligands with high drug-likeness. Derivatives of popular drugs like Remdesivir generated through tools employing evolutionary algorithms are usually considered potential candidates. However, screening promising molecules from such a large search space is difficult. In a conventional screening process, for each ligand-target pair, there are time-consuming interaction studies that use docking simulations before downstream tasks like thermodynamic, kinetic, and electrostatic-potential evaluation.
Objective: This work aims to build a model based on deep learning applied over the graph structure of the molecules to accelerate the screening process for novel potential candidates for SARS-CoV-2 by predicting the binding energy of the protein-ligand complex.
Methods: In this work, ‘Graph Convolutional Capsule Regression’ (GCCR), a model which uses Capsule Neural Networks (CapsNet) and Graph Convolutional Networks (GCN) to predict the binding energy of a protein-ligand complex is being proposed. The model’s predictions were further validated with kinetic and free energy studies like Molecular Dynamics (MD) for kinetic stability and MM/GBSA analysis for free energy calculations.
Results: The GCCR showed an RMSE value of 0.0978 for 81.3% of the concordance index. The RMSE of GCCR converged around the iteration of just 50 epochs scoring a lower RMSE than GCN and GAT. When training with Davis Dataset, GCCR gave an RMSE score of 0.3806 with a CI score of 87.5%.
Conclusion: The proposed GCCR model shows great potential in improving the screening process based on binding affinity and outperforms baseline machine learning models like DeepDTA, KronRLS, Sim- Boost, and other Graph Neural Networks (GNN) based models like Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT).
Graphical Abstract
[http://dx.doi.org/10.1186/s40779-020-00240-0]
[http://dx.doi.org/10.1038/s41586-020-2368-8] [PMID: 32438371]
[http://dx.doi.org/10.26434/chemrxiv.12315437.v1]
[http://dx.doi.org/10.1039/D1RA01603B] [PMID: 35423848]
[http://dx.doi.org/10.1093/bib/bbu010] [PMID: 24723570]
[http://dx.doi.org/10.1186/s13321-017-0209-z] [PMID: 29086119]
[http://dx.doi.org/10.1093/bioinformatics/bty593] [PMID: 30423097]
[http://dx.doi.org/10.1109/CAMSAP45676.2019.9022646]
[http://dx.doi.org/10.1016/j.aiopen.2021.01.001]
[http://dx.doi.org/10.1186/s13321-020-00460-5] [PMID: 33431035]
[http://dx.doi.org/10.1109/TNN.2008.2005605] [PMID: 19068426]
[http://dx.doi.org/10.48550/arXiv.1609.02907]
[http://dx.doi.org/10.48550/arXiv.1312.6203]
[http://dx.doi.org/10.1007/978-3-642-21735-7_6]
[http://dx.doi.org/10.48550/arXiv.2007.06225]
[http://dx.doi.org/10.1038/nbt.1990] [PMID: 22037378]
[http://dx.doi.org/10.1039/D0RA10458B] [PMID: 35423778]
[http://dx.doi.org/10.1186/s43094-020-00171-6] [PMID: 33457429]
[http://dx.doi.org/10.1080/07391102.2020.1862707] [PMID: 33345726]
[http://dx.doi.org/10.1021/ci500588j] [PMID: 25558886]
[http://dx.doi.org/10.1007/978-1-59745-177-2]
[http://dx.doi.org/10.1021/acs.jcim.8b00312] [PMID: 29989806]
[http://dx.doi.org/10.3390/app9214620]
[http://dx.doi.org/10.1016/0022-2836(81)90087-5] [PMID: 7265238]
[http://dx.doi.org/10.1038/s41598-022-10418-2] [PMID: 35538084]
[http://dx.doi.org/10.1093/biomet/92.4.965]