Abstract
Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Combinatorial Chemistry & High Throughput Screening
Title:MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment
Volume: 18 Issue: 6
Author(s): Muthukumarasamy Karthikeyan, Yogesh Pandit, Deepak Pandit and Renu Vyas
Affiliation:
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Abstract: Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Export Options
About this article
Cite this article as:
Karthikeyan Muthukumarasamy, Pandit Yogesh, Pandit Deepak and Vyas Renu, MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment, Combinatorial Chemistry & High Throughput Screening 2015; 18 (6) . https://dx.doi.org/10.2174/1386207318666150703113525
DOI https://dx.doi.org/10.2174/1386207318666150703113525 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |

- Author Guidelines
- Bentham Author Support Services (BASS)
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
Inhibitors of Cytosolic Phospholipase A2α as Potential Anti-Inflammatory Drugs
Anti-Inflammatory & Anti-Allergy Agents in Medicinal Chemistry Design and Structure of Peptide and Peptidomimetic Antagonists of Protein- Protein Interaction
Current Protein & Peptide Science Epigenetic Interventions Increase the Radiation Sensitivity of Cancer Cells
Current Pharmaceutical Design Thyroid Hormone Modulation of Immunity: Its Participation in Chronic Stress-Induced Immune Alterations
Current Immunology Reviews (Discontinued) The Effect of Apium Nodiflorum in Experimental Osteoporosis
Current Pharmaceutical Biotechnology Exploring the Biological Potential of Urea Derivatives Against mPGES-1: A Combination of Quantum Mechanics, Pharmacophore Modelling and QSAR Analyses
Medicinal Chemistry Inspired Nitric Oxide and Modulation of Oxidative Stress During Cardiac Surgery
Current Drug Safety Curcumin and its Formulations: Potential Anti-Cancer Agents
Anti-Cancer Agents in Medicinal Chemistry Compacting Proteins: Pros and Cons of Osmolyte-Induced Folding
Current Protein & Peptide Science Aliskiren: A Novel Renin Inhibitor for Hypertension
Current Drug Therapy Cucurbitacin E, An Experimental Lead Triterpenoid with Anticancer, Immunomodulatory and Novel Effects Against Degenerative Diseases. A Mini-Review
Current Topics in Medicinal Chemistry A beta oligomerization A Therapeutic Target for Alzheimers Disease
Current Medicinal Chemistry - Immunology, Endocrine & Metabolic Agents The Role of Thiols and Disulfides on Protein Stability
Current Protein & Peptide Science Stem Cell-Derived Microvesicles: A Cell Free Therapy Approach to the Regenerative Medicine
Current Biotechnology Innate Immunity, Toll-Like Receptors, and Diabetes
Current Immunology Reviews (Discontinued) Antithyroid Drugs Inactivate TSH Binding to the TSH Receptor by their Reducing Action
Endocrine, Metabolic & Immune Disorders - Drug Targets Combined Amelioration of Ginsenoside (Rg1, Rb1, and Rg3)-enriched Korean Red Ginseng and Probiotic Lactobacillus on Non-alcoholic Fatty Liver Disease
Current Pharmaceutical Biotechnology Medical and Dental Implications of Down Syndrome: A Review Part 1: General and Craniofacial Characteristic
Applied Clinical Research, Clinical Trials and Regulatory Affairs Gene Therapy for Brain Cancer: Combination Therapies Provide Enhanced Efficacy and Safety
Current Gene Therapy Aspartic Protease Inhibitors as Potential Anti-Candida albicans Drugs: Impacts on Fungal Biology, Virulence and Pathogenesis
Current Medicinal Chemistry