Abstract
In this paper, we propose a strategy to predict the subcellular locations of proteins by combining various feature selection methods. Firstly, proteins are coded by amino-acid composition and physicochemical properties, then these features are arranged by Minimum Redundancy Maximum Relevance method and further filtered by feature selection procedure. Nearest Neighbor Algorithm is used as a prediction model to predict the protein subcellular locations, and gains a correct prediction rate of 70.63%, evaluated by Jackknife cross-validation. Results of feature selection also enable us to identify the most important protein properties. The prediction software is available for public access on the website http://chemdata.shu.edu.cn/sub22/, which may play a important complementary role to a series of web-server predictors summarized recently in a review by Chou and Shen (Chou, K.C., Shen, H.B. Natural Science, 2009, 2, 63-92, http://www.scirp.org/journal/NS/).
Keywords: Subcellular location of proteins, Minimum Redundancy Maximum Relevance, Feature Selection, Nearest Neighbor Algorithm, Jackknife cross-validation test
Protein & Peptide Letters
Title: Prediction of Protein Subcellular Locations with Feature Selection and Analysis
Volume: 17 Issue: 4
Author(s): Yudong Cai, Jianfeng He, Xinlei Li, Kaiyan Feng, Lin Lu, Kairui Feng, Xiangyin Kong and Wencong Lu
Affiliation:
Keywords: Subcellular location of proteins, Minimum Redundancy Maximum Relevance, Feature Selection, Nearest Neighbor Algorithm, Jackknife cross-validation test
Abstract: In this paper, we propose a strategy to predict the subcellular locations of proteins by combining various feature selection methods. Firstly, proteins are coded by amino-acid composition and physicochemical properties, then these features are arranged by Minimum Redundancy Maximum Relevance method and further filtered by feature selection procedure. Nearest Neighbor Algorithm is used as a prediction model to predict the protein subcellular locations, and gains a correct prediction rate of 70.63%, evaluated by Jackknife cross-validation. Results of feature selection also enable us to identify the most important protein properties. The prediction software is available for public access on the website http://chemdata.shu.edu.cn/sub22/, which may play a important complementary role to a series of web-server predictors summarized recently in a review by Chou and Shen (Chou, K.C., Shen, H.B. Natural Science, 2009, 2, 63-92, http://www.scirp.org/journal/NS/).
Export Options
About this article
Cite this article as:
Cai Yudong, He Jianfeng, Li Xinlei, Feng Kaiyan, Lu Lin, Feng Kairui, Kong Xiangyin and Lu Wencong, Prediction of Protein Subcellular Locations with Feature Selection and Analysis, Protein & Peptide Letters 2010; 17 (4) . https://dx.doi.org/10.2174/092986610790963654
DOI https://dx.doi.org/10.2174/092986610790963654 |
Print ISSN 0929-8665 |
Publisher Name Bentham Science Publisher |
Online ISSN 1875-5305 |
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers
Related Articles
-
COVID-19 Pandemic and the Impact on the Cardiovascular Disease Patient Care
Current Cardiology Reviews The Patents on Glucocorticosteroids and Selected New Therapies for the Management of Asthma in Children: Update
Recent Patents on Inflammation & Allergy Drug Discovery Nutraceuticals and Bio-inspired Materials from Microalgae and their Future Perspectives
Current Topics in Medicinal Chemistry Cardioembolic Stroke: Clinical Features, Specific Cardiac Disorders and Prognosis
Current Cardiology Reviews Covid-19: An Update on Clinical Features, Diagnosis, and Treatment Strategies
Coronaviruses Dual Cross-Talk between Nitric Oxide and D-Serine in Astrocytes and Neurons in the Brain
Central Nervous System Agents in Medicinal Chemistry Perioperative Considerations in Rheumatoid Arthritis Patients
Current Rheumatology Reviews Chemotherapy with si-RNA and Anti-Cancer Drugs
Current Drug Delivery Role of Polymorphisms in Factor V (FV Leiden), Prothrombin, Plasminogen Activator Inhibitor Type-1 (PAI-1), Methylenetetrahydrofolate Reductase (MTHFR) and Cystathionine β-Synthase (CBS) Genes as Risk Factors for Thrombophilias
Mini-Reviews in Medicinal Chemistry Neuronal-glial Interactions Define the Role of Nitric Oxide in Neural Functional Processes
Current Neuropharmacology Over-Expression, Solubilization, and Purification of G Protein-Coupled Receptors for Structural Biology
Combinatorial Chemistry & High Throughput Screening Prayer at Midlife is Associated with Reduced Risk of Cognitive Decline in Arabic Women
Current Alzheimer Research Antidiabetic Potential of Naturally Occurring Sesquiterpenes: A Review
Current Topics in Medicinal Chemistry Anaemia in Diabetes: An Emerging Complication of Microvascular Disease
Current Diabetes Reviews Wnt/β-catenin Antagonists: Exploring New Avenues to Trigger Old Drugs in Alleviating Glioblastoma Multiforme
Current Molecular Pharmacology Regulation of Intestinal Barrier Function by Dietary Polyphenols
Current Nutrition & Food Science Hybrid Docking-QSAR Studies of 1, 4-dihydropyridine-3, 5-Dicarboxamides as Potential Antitubercular Agents
Current Computer-Aided Drug Design Biomaterial and Mesenchymal Stem Cell for Articular Cartilage Reconstruction
Current Stem Cell Research & Therapy <i>Nigella sativa</i>, as Preventive Strategy in COVID-19
Current Traditional Medicine Novel Lipid and Polymeric Materials as Delivery Systems for Nucleic Acid Based Drugs
Current Drug Metabolism