Abstract
Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.
Keywords: Proteogenomics, databases, bioinformatics, gene annotation, mass-spectrometry, proteomics.
Current Topics in Medicinal Chemistry
Title:Database Construction and Peptide Identification Strategies for Proteogenomic Studies on Sequenced Genomes
Volume: 14 Issue: 3
Author(s): Celine Hernandez, Patrice Waridel and Manfredo Quadroni
Affiliation:
Keywords: Proteogenomics, databases, bioinformatics, gene annotation, mass-spectrometry, proteomics.
Abstract: Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.
Export Options
About this article
Cite this article as:
Hernandez Celine, Waridel Patrice and Quadroni Manfredo, Database Construction and Peptide Identification Strategies for Proteogenomic Studies on Sequenced Genomes, Current Topics in Medicinal Chemistry 2014; 14 (3) . https://dx.doi.org/10.2174/1568026613666131204105652
DOI https://dx.doi.org/10.2174/1568026613666131204105652 |
Print ISSN 1568-0266 |
Publisher Name Bentham Science Publisher |
Online ISSN 1873-4294 |
![](/images/wayfinder.jpg)
- Author Guidelines
- Bentham Author Support Services (BASS)
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Pentapeptides as Minimal Functional Units in Cell Biology and Immunology
Current Protein & Peptide Science FTY720: A Most Promising Immunosuppressant Modulating Immune Cell Functions
Mini-Reviews in Medicinal Chemistry The Role of the Metabolism of Anticancer Drugs in Their Induced-Cardiotoxicity
Current Drug Metabolism Oxidative/Nitrosative Stress and Immuno-inflammatory Pathways in Depression: Treatment Implications
Current Pharmaceutical Design Matrix Metalloproteinase Inhibitors: A Review on Bioanalytical Methods, Pharmacokinetics and Metabolism
Current Drug Metabolism New Insight into P-Glycoprotein as a Drug Target
Anti-Cancer Agents in Medicinal Chemistry Cancer Vaccines in Phase II/III Clinical Trials: State of the Art and Future Perspectives
Current Cancer Drug Targets Metformin for Prevention and Treatment of Colon Cancer: A Reappraisal of Experimental and Clinical Data
Current Drug Targets Beta-Caryophyllene Suppresses Ovarian Cancer Proliferation by Inducing Cell Cycle Arrest and Apoptosis
Anti-Cancer Agents in Medicinal Chemistry Recent Advances in the Synthesis of Biologically Relevant Selenium-containing 5-Membered Heterocycles
Current Organic Chemistry Future Prospects for Targeted Alpha Therapy
Current Radiopharmaceuticals Dual-Targeted Molecular Probes for Cancer Imaging
Current Pharmaceutical Biotechnology Proteomics of Airway Inflammatory Cells in Asthma Research
Current Proteomics Structures of TGF-β Receptor Complexes: Implications for Function and Therapeutic Intervention Using Ligand Traps
Current Pharmaceutical Biotechnology Interaction Between Estrogen Receptor Alpha and Insulin/IGF Signaling in Breast Cancer
Current Cancer Drug Targets Protein Degradation Pathways after Brain Ischemia
Current Drug Targets Recent Trends in Nanotechnology-Based Drugs and Formulations for Targeted Therapeutic Delivery
Recent Patents on Inflammation & Allergy Drug Discovery Circulating Aminopeptidase Activities in Men and Women with Essential Hypertension
Current Medicinal Chemistry Novel Protein Targeted Therapy of Metastatic Melanoma
Current Pharmaceutical Design Bioinformatic Application in Proteomic Research on Biomarker Discovery and Drug Target Validation
Current Bioinformatics