Abstract
Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.
Keywords: Proteogenomics, databases, bioinformatics, gene annotation, mass-spectrometry, proteomics.
Current Topics in Medicinal Chemistry
Title:Database Construction and Peptide Identification Strategies for Proteogenomic Studies on Sequenced Genomes
Volume: 14 Issue: 3
Author(s): Celine Hernandez, Patrice Waridel and Manfredo Quadroni
Affiliation:
Keywords: Proteogenomics, databases, bioinformatics, gene annotation, mass-spectrometry, proteomics.
Abstract: Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.
Export Options
About this article
Cite this article as:
Hernandez Celine, Waridel Patrice and Quadroni Manfredo, Database Construction and Peptide Identification Strategies for Proteogenomic Studies on Sequenced Genomes, Current Topics in Medicinal Chemistry 2014; 14 (3) . https://dx.doi.org/10.2174/1568026613666131204105652
DOI https://dx.doi.org/10.2174/1568026613666131204105652 |
Print ISSN 1568-0266 |
Publisher Name Bentham Science Publisher |
Online ISSN 1873-4294 |
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Non-adenosine Nucleoside Inosine, Guanosine and Uridine as Promising Antiepileptic Drugs: a Summary of Current Literature
Mini-Reviews in Medicinal Chemistry Gold Nanoparticles; Potential Nanotheranostic Agent in Breast Cancer: A Comprehensive Review with Systematic Search Strategy
Current Drug Metabolism Endothelial Cells in Inflammation and Angiogenesis
Current Drug Targets - Inflammation & Allergy Ameliorative Effect of α-Tocopherol and Selenium on Effects of Malathion on Plasmatic Biochemical Indices and Lesions in the Liver of Rats
Current Pharmaceutical Analysis CXCR1/2 Chemokine Network Regulates Melanoma Resistance to Chemotherapies Mediated by NF-κB
Current Molecular Medicine FDG-PET/CT Predicts Outcome in Oropharingeal Carcinoma Patients Undergoing Intensity Modulated Radiation Therapy with Dose Escalation to FDG-avid Tumour Volumes
Current Radiopharmaceuticals Advances in the use of MOFs for Cancer Diagnosis and Treatment: An Overview
Current Pharmaceutical Design Differentiated Thyroid Carcinoma and Intestinal Polyposis Syndromes
Endocrine, Metabolic & Immune Disorders - Drug Targets Renal Nerve Ablation for Hypertensive Patients with Chronic Kidney Disease
Current Vascular Pharmacology Aptamers: Potential Applications to Pancreatic Cancer Therapy
Anti-Cancer Agents in Medicinal Chemistry Revisiting Non-Cancer Drugs for Cancer Therapy
Current Topics in Medicinal Chemistry A Review of Natural and Synthetic Antioxidants Important for Health and Longevity
Current Medicinal Chemistry PET Tracers Based on 86Y
Current Radiopharmaceuticals Meet Our Associate Editor:
Current Molecular Medicine Heparin Derivatives as Angiogenesis Inhibitors
Current Pharmaceutical Design Breast Cancer: Biological Characteristics in Postmenopausal Type 2 Diabetic Women. Identification of Therapeutic Targets
Current Drug Targets - Immune, Endocrine & Metabolic Disorders Chlorella vulgaris Induces Apoptosis of Human Non-Small Cell Lung Carcinoma (NSCLC) Cells
Medicinal Chemistry PAF Receptor and Tumor Growth
Current Drug Targets Targeting Different Signaling Pathways with Antisense Oligonucleotides Combination for Cancer Therapy
Current Pharmaceutical Design ANTI-ADHESION Evolves To a Promising Therapeutic Concept in Oncology
Current Medicinal Chemistry