Abstract
High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Current Genomics
Title: Performance of Feature Selection Methods
Volume: 10 Issue: 6
Author(s): Edward R. Dougherty, Jianping Hua and Chao Sima
Affiliation:
Abstract: High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Export Options
About this article
Cite this article as:
Dougherty R. Edward, Hua Jianping and Sima Chao, Performance of Feature Selection Methods, Current Genomics 2009; 10 (6) . https://dx.doi.org/10.2174/138920209789177629
DOI https://dx.doi.org/10.2174/138920209789177629 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Current Genomics in Cardiovascular Research
Cardiovascular diseases are the main cause of death in the world, in recent years we have had important advances in the interaction between cardiovascular disease and genomics. In this Research Topic, we intend for researchers to present their results with a focus on basic, translational and clinical investigations associated with ...read more
Deep learning in Single Cell Analysis
The field of biology is undergoing a revolution in our ability to study individual cells at the molecular level, and to integrate data from multiple sources and modalities. This has been made possible by advances in technologies for single-cell sequencing, multi-omics profiling, spatial transcriptomics, and high-throughput imaging, as well as ...read more
New insights on Pediatric Tumors and Associated Cancer Predisposition Syndromes
Because of the broad spectrum of children cancer susceptibility, the diagnosis of cancer risk syndromes in children is rarely used in direct cancer treatment. The field of pediatric cancer genetics and genomics will only continue to expand as a result of increasing use of genetic testing tools. It's possible that ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Cationic Polymers and their Self-Assembly for Antibacterial Applications
Current Topics in Medicinal Chemistry Targeted α-Particle Therapy: A Clinical Overview
Current Radiopharmaceuticals Epidermal Growth Factor Receptor (EGFR) Tyrosine Kinase Inhibitors from the Natural Origin: A Recent Perspective
Anti-Cancer Agents in Medicinal Chemistry Development and Current Status of Unconventional Platinum Anticancer Complexes
Mini-Reviews in Medicinal Chemistry The Current State of Potential Therapeutic Modalities for Glioblastoma Multiforme: A Clinical Review
Current Drug Metabolism The Use of Herbal Medicine in Cancer-related Anorexia/ Cachexia Treatment Around the World
Current Pharmaceutical Design Structural Basis and Therapeutic Implication of the Interaction of CCN Proteins with Glycoconjugates
Current Pharmaceutical Design Therapeutic Targeting of Melanoma Cells Using Neural Stem Cells Expressing Carboxylesterase, a CPT-11 Activating Enzyme
Current Stem Cell Research & Therapy Soft Matter Assemblies as Nanomedicine Platforms for Cancer Chemotherapy: A Journey from Market Products Towards Novel Approaches
Current Topics in Medicinal Chemistry Targeting the Role of Astrocytes in the Progression of Alzheimers Disease
Current Signal Transduction Therapy Recent Advances in the Development of 14-Alkoxy Substituted Morphinans as Potent and Safer Opioid Analgesics
Current Medicinal Chemistry The PI3K/Akt Pathway: Recent Progress in the Development of ATP-Competitive and Allosteric Akt Kinase Inhibitors
Current Cancer Drug Targets Apoptosis Pathways and Neuroblastoma Therapy
Current Pharmaceutical Design Heterologous Production of Death Ligands’ and Death Receptors’ Extracellular Domains: Structural Features and Efficient Systems
Protein & Peptide Letters Microemulsions and Nanoemulsions for Targeted Drug Delivery to the Brain
Current Nanoscience Marine Natural Products and Related Compounds as Anticancer Agents: an Overview of their Clinical Status
Anti-Cancer Agents in Medicinal Chemistry Peptide Prodrugs for the Treatment of CNS Disorders: A Perspective for New Drugs
Current Medicinal Chemistry Manipulation and Engineering of Metabolic and Biosynthetic Pathway of Plant Polyphenols
Current Pharmaceutical Design Matrine: Bioactivities and Structural Modifications
Current Topics in Medicinal Chemistry CEST MRI for Molecular Imaging of Brain Metabolites
Current Molecular Imaging (Discontinued)