Abstract
Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Combinatorial Chemistry & High Throughput Screening
Title:MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment
Volume: 18 Issue: 6
Author(s): Muthukumarasamy Karthikeyan, Yogesh Pandit, Deepak Pandit and Renu Vyas
Affiliation:
Keywords: Chemoinformatics, cloud computing, malaria, text mining, virtual screening.
Abstract: Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.
Export Options
About this article
Cite this article as:
Karthikeyan Muthukumarasamy, Pandit Yogesh, Pandit Deepak and Vyas Renu, MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment, Combinatorial Chemistry & High Throughput Screening 2015; 18 (6) . https://dx.doi.org/10.2174/1386207318666150703113525
DOI https://dx.doi.org/10.2174/1386207318666150703113525 |
Print ISSN 1386-2073 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5402 |

- Author Guidelines
- Bentham Author Support Services (BASS)
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
Epidemiology and Management of Infectious Complications in Contemporary Management of Chronic Leukemias
Infectious Disorders - Drug Targets Biomarkers for Early Detection of Liver Cancer: Focus on Clinical Evaluation
Protein & Peptide Letters Production of Recombinant Oxytocin Through Sulfitolysis of Inteincontaining Fusion Protein
Protein & Peptide Letters Resveratrol and Stroke: from Chemistry to Medicine
Current Neurovascular Research Functional Foods: Salient Features and Clinical Applications
Current Drug Targets - Immune, Endocrine & Metabolic Disorders The Use of Erythropoietin and its Derivatives to Treat Spinal Cord Injury
Mini-Reviews in Medicinal Chemistry Intramammary Application of Non-Methylated-CpG Oligodeoxynucleotides (CpG) Inhibits both Local and Systemic Mammary Carcinogenesis in Female BALB/c Her-2/neu Transgenic Mice
Current Cancer Drug Targets A Review of Calcium Pyrophosphate Deposition (CPPD)
Current Medical Imaging Inflammatory Related Cardiovascular Diseases: From Molecular Mechanisms to Therapeutic Targets
Current Pharmaceutical Design CD4+CD25+ T Regulatory Cells and TGF-β in Mucosal Immune System: The Good and the Bad
Current Medicinal Chemistry Cytotoxic Effects of Glass Ionomer Cements on Human Dental Pulp Stem Cells Correlate with Fluoride Release
Medicinal Chemistry Inflammation as a Therapeutic Target in Acute Ischemic Stroke Treatment
Current Topics in Medicinal Chemistry Role of Inflammation and Tumor Microenvironment in the Development of Gastrointestinal Cancers: What Induced Pluripotent Stem Cells Can Do?
Current Stem Cell Research & Therapy Non-Canonical Peptides Bound to MHC
Current Pharmaceutical Design Rational Design and Intramolecular Cyclization of Hotspot Peptide Segments at YAP–TEAD4 Complex Interface
Protein & Peptide Letters Design, Synthesis and Biological Screening of Some Pyridinylpyrazole and Pyridinylisoxazole Derivatives as Potential Anti-inflammatory, Analgesic, Antipyretic and Antimicrobial Agents
Medicinal Chemistry Optimization of Microemulgel for Tizanidine Hydrochloride
Anti-Inflammatory & Anti-Allergy Agents in Medicinal Chemistry Pathomechanisms of Myocardial Dysfunction in Sepsis
Endocrine, Metabolic & Immune Disorders - Drug Targets Inflammatory Bowel Disease in Migrant Populations: Should we Look Even Further Back?
Current Drug Targets Correlation Between Circulating Adhesion Molecules and Resistin Levels in Hypertensive Type-2 Diabetic Patients
Inflammation & Allergy - Drug Targets (Discontinued)