Abstract
Background: In the vast variety of viruses known, there is a particular interest in those transmitted to humans and whose ability to disseminate represents a significant public health issue.
Objective: The present study’s objective is to bioinformatically characterize the proteins of the two main divisions of viruses, RNA-viruses and DNA-viruses.
Methods: In this work, a set of in-house computational programs was used to calculate the polarity/ charge profiles and intrinsic disorder predisposition profiles of the proteins of several groups of viruses representing both types extracted from the UniProt database. The efficiency of these computational programs was statistically verified.
Results: It was found that the polarity/charge profile of the proteins is, in most cases, an efficient discriminant that allows the re-creation of the taxonomy known for both viral groups. Additionally, the entire set of “reviewed” proteins in the UniProt database was analyzed to find proteins with polarity/ charge profiles similar to those obtained for each viral group. This search revealed a substantial number of proteins with such polarity-charge profiles.
Conclusion: Polarity/charge profile represents a physicochemical metric, which is easy to calculate, and which can be used to effectively identify viral groups from their protein sequences.
Keywords: DNA viral protein groups, RNA viral protein groups, structural proteomics, bioinformatics, intrinsic disorder predisposition, polarity/charge profile.
Graphical Abstract
[http://dx.doi.org/10.1097/QCO.0000000000000079] [PMID: 24977685]
[http://dx.doi.org/10.1186/s12889-019-7707-z] [PMID: 31623594]
[http://dx.doi.org/10.1186/s41256-020-00135-6] [PMID: 32226823]
[http://dx.doi.org/10.1016/j.genrep.2020.100682] [PMID: 32300673]
[http://dx.doi.org/10.1016/j.meegid.2019.02.001] [PMID: 30735813]
[http://dx.doi.org/10.1161/CIRCRESAHA.120.317015] [PMID: 32264791]
[http://dx.doi.org/10.1002/1097-0134(20010101)42:1<38::AID-PROT50>3.0.CO;2-3] [PMID: 11093259]
[http://dx.doi.org/10.1142/S0219720005000886] [PMID: 15751111]
[http://dx.doi.org/10.1002/prot.20735] [PMID: 16187360]
[http://dx.doi.org/10.1016/j.bbapap.2010.01.011] [PMID: 20100603]
[http://dx.doi.org/10.1093/bioinformatics/bti541] [PMID: 15955779]
[http://dx.doi.org/10.1002/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7] [PMID: 11025552]
[http://dx.doi.org/10.1016/S1093-3263(00)00138-8] [PMID: 11381529]
[http://dx.doi.org/10.1529/biophysj.106.094045] [PMID: 17158572]
[http://dx.doi.org/10.1186/1471-2105-8-211] [PMID: 17578581]
[http://dx.doi.org/10.1038/cr.2009.87] [PMID: 19597536]
[http://dx.doi.org/10.1007/s00018-017-2555-4] [PMID: 28589442]
[http://dx.doi.org/10.1021/bi047993o] [PMID: 15697224]
[http://dx.doi.org/10.12688/f1000research.20867.1]
[http://dx.doi.org/10.1093/bioinformatics/btu625] [PMID: 25246432]
[http://dx.doi.org/10.1093/nar/gkh131] [PMID: 14681372]
[http://dx.doi.org/10.1093/database/bas015] [PMID: 22403286]
[PMID: 29311024]
[PMID: 27402792]