Abstract
High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Current Genomics
Title: Performance of Feature Selection Methods
Volume: 10 Issue: 6
Author(s): Edward R. Dougherty, Jianping Hua and Chao Sima
Affiliation:
Abstract: High-throughput biological technologies offer the promise of finding feature sets to serve as biomarkers for medical applications; however, the sheer number of potential features (genes, proteins, etc.) means that there needs to be massive feature selection, far greater than that envisioned in the classical literature. This paper considers performance analysis for feature-selection algorithms from two fundamental perspectives: How does the classification accuracy achieved with a selected feature set compare to the accuracy when the best feature set is used and what is the optimal number of features that should be used? The criteria manifest themselves in several issues that need to be considered when examining the efficacy of a feature-selection algorithm: (1) the correlation between the classifier errors for the selected feature set and the theoretically best feature set; (2) the regressions of the aforementioned errors upon one another; (3) the peaking phenomenon, that is, the effect of sample size on feature selection; and (4) the analysis of feature selection in the framework of high-dimensional models corresponding to high-throughput data.
Export Options
About this article
Cite this article as:
Dougherty R. Edward, Hua Jianping and Sima Chao, Performance of Feature Selection Methods, Current Genomics 2009; 10 (6) . https://dx.doi.org/10.2174/138920209789177629
DOI https://dx.doi.org/10.2174/138920209789177629 |
Print ISSN 1389-2029 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5488 |
Call for Papers in Thematic Issues
Current Genomics in Cardiovascular Research
Cardiovascular diseases are the main cause of death in the world, in recent years we have had important advances in the interaction between cardiovascular disease and genomics. In this Research Topic, we intend for researchers to present their results with a focus on basic, translational and clinical investigations associated with ...read more
Deep learning in Single Cell Analysis
The field of biology is undergoing a revolution in our ability to study individual cells at the molecular level, and to integrate data from multiple sources and modalities. This has been made possible by advances in technologies for single-cell sequencing, multi-omics profiling, spatial transcriptomics, and high-throughput imaging, as well as ...read more
New insights on Pediatric Tumors and Associated Cancer Predisposition Syndromes
Because of the broad spectrum of children cancer susceptibility, the diagnosis of cancer risk syndromes in children is rarely used in direct cancer treatment. The field of pediatric cancer genetics and genomics will only continue to expand as a result of increasing use of genetic testing tools. It's possible that ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
- Announcements
Related Articles
-
Monocarboxylate Transporter 1 in Brain Diseases and Cancers
Current Drug Metabolism Is VEGF a Key Target of Cotinine and Other Potential Therapies Against Alzheimer Disease?
Current Alzheimer Research Polypharmacological Properties and Therapeutic Potential of β-Caryophyllene: A Dietary Phytocannabinoid of Pharmaceutical Promise
Current Pharmaceutical Design Anti-miRNA-23a Oligonucleotide Suppresses Glioma Cells Growth by Targeting Apoptotic Protease Activating Factor-1
Current Pharmaceutical Design Anti-angiogenic Treatment in Metastatic Colorectal Cancer: Current Issues and Future Aims
Current Cancer Therapy Reviews Epithelial Mesenchymal Transition in Cancer Progression: Prev entive Phytochemicals
Recent Patents on Anti-Cancer Drug Discovery HSV-1-Derived Recombinant and Amplicon Vectors for Gene Transfer and Gene Therapy
Current Gene Therapy Safety and Side Effects of Cannabidiol, a Cannabis sativa Constituent
Current Drug Safety Metal Oxide Nanomaterials in Nanomedicine: Applications in Photodynamic Therapy and Potential Toxicity
Current Topics in Medicinal Chemistry Systemic Therapeutic Gene Delivery for Cancer: Crafting Paris Arrow
Current Gene Therapy Interactions of Cnidarian Toxins with the Immune System
Inflammation & Allergy - Drug Targets (Discontinued) Microdosing, Imaging Biomarkers and SPECT: A Multi-Sided Tripod to Accelerate Drug Development
Current Pharmaceutical Design Microarray Technologies for Intracellular Kinome Analysis
Current Medicinal Chemistry Combination of Phytochemicals as Adjuvants for Cancer Therapy
Recent Patents on Anti-Cancer Drug Discovery Blockade of Furin Activity and Furin-Induced Tumor Cells Malignant Phenotypes By The Chemically Synthesized Human Furin Prodomain
Current Medicinal Chemistry Anti-Vascular Endothelial Growth Factor Treatment in Retinal Vein Occlusions
Current Drug Therapy Epigenetic Regulation of Epithelial-Mesenchymal Transition by Hypoxia in Cancer: Targets and Therapy
Current Pharmaceutical Design Selective VEGFR Inhibitors for Anticancer Therapeutics in Clinical Use and Clinical Trials
Current Pharmaceutical Design The Role of miR-129-5p in Cancer: A Novel Therapeutic Target
Current Molecular Pharmacology The Current Role of PET/CT in Radiotherapy Planning
Current Radiopharmaceuticals