Abstract
High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Current Genomics
Title: Performance of Feature Selection Methods
Volume: 10 Issue: 6
Author(s): Edward R. Dougherty, Jianping Hua and Chao Sima
Affiliation:
Abstract: High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Export Options
About this article
Cite this article as:
Dougherty R. Edward, Hua Jianping and Sima Chao, Performance of Feature Selection Methods, Current Genomics 2009; 10 (6) . https://dx.doi.org/10.2174/138920209789177629
DOI https://dx.doi.org/10.2174/138920209789177629 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Current Genomics in Cardiovascular Research
Cardiovascular diseases are the main cause of death in the world, in recent years we have had important advances in the interaction between cardiovascular disease and genomics. In this Research Topic, we intend for researchers to present their results with a focus on basic, translational and clinical investigations associated with ...read more
Deep learning in Single Cell Analysis
The field of biology is undergoing a revolution in our ability to study individual cells at the molecular level, and to integrate data from multiple sources and modalities. This has been made possible by advances in technologies for single-cell sequencing, multi-omics profiling, spatial transcriptomics, and high-throughput imaging, as well as ...read more
New insights on Pediatric Tumors and Associated Cancer Predisposition Syndromes
Because of the broad spectrum of children cancer susceptibility, the diagnosis of cancer risk syndromes in children is rarely used in direct cancer treatment. The field of pediatric cancer genetics and genomics will only continue to expand as a result of increasing use of genetic testing tools. It's possible that ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Evidence of PKC Binding and Translocation to Explain the Anticancer Mechanism of Chlorogenic Acid in Breast Cancer Cells
Current Molecular Medicine Role of ATP-Binding Cassette Transporter Proteins in CNS Tumors: Resistance- Based Perspectives and Clinical Updates
Current Pharmaceutical Design Curcumin Nanomedicine: A Road to Cancer Therapeutics
Current Pharmaceutical Design The Dual Role of Nitric Oxide in Glioma
Current Pharmaceutical Design Noscapine and its Analogs as Chemotherapeutic Agent: Current updates
Current Topics in Medicinal Chemistry Predictive Efficacy Biomarkers of Programmed Cell Death 1/Programmed Cell Death 1 Ligand Blockade Therapy
Recent Patents on Anti-Cancer Drug Discovery Polyethylenimine as a Promising Vector for Targeted siRNA Delivery
Current Clinical Pharmacology Hypomethylation and Activation of Syncytin-1 Gene in Endometriotic Tissue
Current Pharmaceutical Design Sirtuins Family- Recent Development as a Drug Target for Aging, Metabolism, and Age Related Diseases
Current Drug Targets H+-myo-Inositol Transporter SLC2A13 as a Potential Marker for Cancer Stem Cells in an Oral Squamous Cell Carcinoma
Current Cancer Drug Targets Apoptosis is a Critical Cellular Event in Cancer Chemoprevention and Chemotherapy by Selenium Compounds
Current Cancer Drug Targets Emerging Role of Circular RNAs in Kidney Diseases in Nephrology
Current Drug Targets Sonic Hedgehog Pathway as a Target for Therapy in Angiogenesis-Related Diseases
Current Signal Transduction Therapy Human- and Virus-Encoded microRNAs as Potential Targets of Antiviral Therapy
Mini-Reviews in Medicinal Chemistry Effectivity of Long Antigen Exposition Dendritic Cell Therapy (LANEXDC<sup>®</sup>) in the Palliative Treatment of Pancreatic Cancer
Current Medicinal Chemistry Smart Mesoporous Silica Nanocarriers for Antitumoral Therapy
Current Topics in Medicinal Chemistry The Emerging Role of Poly(ADP-Ribose) Polymerase Inhibitors in Cancer Treatment
Current Drug Targets The Role of Shcbp1 in Signaling and Disease
Current Cancer Drug Targets TGF-Beta Type I Receptor (Alk5) Kinase Inhibitors in Oncology
Current Pharmaceutical Biotechnology Antiangiogenesis and Radiotherapy: What Is the Role of Combined Modality Treatment?
Current Medicinal Chemistry - Anti-Cancer Agents