Abstract
High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Current Genomics
Title: Performance of Feature Selection Methods
Volume: 10 Issue: 6
Author(s): Edward R. Dougherty, Jianping Hua and Chao Sima
Affiliation:
Abstract: High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Export Options
About this article
Cite this article as:
Dougherty R. Edward, Hua Jianping and Sima Chao, Performance of Feature Selection Methods, Current Genomics 2009; 10 (6) . https://dx.doi.org/10.2174/138920209789177629
DOI https://dx.doi.org/10.2174/138920209789177629 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Current Genomics in Cardiovascular Research
Cardiovascular diseases are the main cause of death in the world, in recent years we have had important advances in the interaction between cardiovascular disease and genomics. In this Research Topic, we intend for researchers to present their results with a focus on basic, translational and clinical investigations associated with ...read more
Deep learning in Single Cell Analysis
The field of biology is undergoing a revolution in our ability to study individual cells at the molecular level, and to integrate data from multiple sources and modalities. This has been made possible by advances in technologies for single-cell sequencing, multi-omics profiling, spatial transcriptomics, and high-throughput imaging, as well as ...read more
New insights on Pediatric Tumors and Associated Cancer Predisposition Syndromes
Because of the broad spectrum of children cancer susceptibility, the diagnosis of cancer risk syndromes in children is rarely used in direct cancer treatment. The field of pediatric cancer genetics and genomics will only continue to expand as a result of increasing use of genetic testing tools. It's possible that ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Selenium Compounds and Apoptotic Modulation: A New Perspective in Cancer Therapy
Mini-Reviews in Medicinal Chemistry Heme Oxygenase -1 Gene Therapy: Recent Advances and Therapeutic Applications
Current Gene Therapy Bioactive Heterocyclic Compounds as Potential Therapeutics in the Treatment of Gliomas: A Review
Anti-Cancer Agents in Medicinal Chemistry Retinal Ganglion Cell Gene Therapy and Visual System Repair
Current Gene Therapy Targeting Cytotoxic Conjugates of Somatostatin, Luteinizing Hormone- Releasing Hormone and Bombesin to Cancers Expressing Their Receptors: A “Smarter” Chemotherapy
Current Pharmaceutical Design Synthesis and In Vitro Evaluation of Novel 1,2,3,4-Tetrahydroisoquinoline Derivatives as Potent Antiglioma Agents
Anti-Cancer Agents in Medicinal Chemistry Implications of Epigenetic Mechanisms and their Targets in Cerebral Ischemia Models
Current Neuropharmacology Potential Usage of ING Family Members in Cancer Diagnostics and Molecular Therapy
Current Drug Targets Engineered Exosomes: A Promising Drug Delivery Strategy for Brain Diseases
Current Medicinal Chemistry Recent Advances in Oncological Submissions of Dendrimer
Current Pharmaceutical Design Functional Role of miR-34 Family in Human Cancer
Current Drug Targets Evaluation of Venom as a Promising Tool for Drug Discovery: Focusing on Neurological Disorders
Venoms and Toxins Highly Organized Nanostructures for Brain Drug Delivery - New Hope or Just a Fad?
CNS & Neurological Disorders - Drug Targets An Investigative Approach to Treatment Modalities for Squamous Cell Carcinoma of Skin
Current Drug Delivery Emerging Treatments in Acute Lymphoblastic Leukemia
Current Cancer Drug Targets Recent Progress in the Development of ATP-Competitive and Allosteric Akt Kinase Inhibitors
Current Topics in Medicinal Chemistry Derivatives of Procaspase-Activating Compound 1 (PAC-1) and their Anticancer Activities
Current Medicinal Chemistry Epidermal Growth Factor Receptor (EGFR) Tyrosine Kinase Inhibitors from the Natural Origin: A Recent Perspective
Anti-Cancer Agents in Medicinal Chemistry Design and Synthesis of Tetrahydroisoquinoline Derivatives as Anti-Angiogenesis and Anti-Cancer Agents
Anti-Cancer Agents in Medicinal Chemistry Palmitoylethanolamide Regulates Production of Pro-Angiogenic Mediators in a Model of β Amyloid-Induced Astrogliosis <i>In Vitro</i>
CNS & Neurological Disorders - Drug Targets