A Simple Protein Evolutionary Classification Method Based on the Mutual Relations Between Protein Sequences

Xiaogeng	      Wan; Xinying	      Tan

doi:10.2174/1574893615666200305090055

Abstract

Background: Protein is a kind of important organics in life. It is varied with its sequences, structures and functions. Protein evolutionary classification is one of the popular research topics in computational bioinformatics. Many studies have used protein sequence information to classify the evolutionary relationships of proteins. As the amount of protein sequence data increases, efficient computational tools are needed to make efficient protein evolutionary classifications with high accuracies in the big data paradigm.

Methods: In this study, we propose a new simple and efficient computational approach based on the normalized mutual information rates to compute the relationship between protein sequences, we then use the “distances” defined on the relationships to perform the evolutionary classifications of proteins. The new method is computational efficient, model-free and unsupervised, which does not require training data when performing classifications.

Results: Simulation studies on various examples demonstrate the efficiency of the new method. We use precision-recall curves to compare the efficiency of our new method with traditional methods, results show that the new method outperforms the traditional methods in most of the cases when performing evolutionary classifications.

Conclusion: The new method is simple and proved to be efficient in protein evolutionary classifications, which is useful in future evolutionary analysis particularly in the big data paradigm.

Keywords: Protein evolutionary classification, mutual information rate, protein sequence, precision-recall, computational, machine learning.

« Previous Next »

Graphical Abstract

Rights & Permissions Print Cite

Article Metrics

29

1

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1574893615666200305090055	Print ISSN 1574-8936
Publisher Name Bentham Science Publisher	Online ISSN 2212-392X

Current Bioinformatics

A Simple Protein Evolutionary Classification Method Based on the Mutual Relations Between Protein Sequences

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Abstract