Abstract
Background: Protein sequence and structure analyses have been essential components of bioinformatics and structural biology. They provide a deeper insight into the physicochemical properties, structure, and subsequent functions of a protein. Advanced computational approaches and bioinformatics utilities help solve several issues related to protein analysis. Still, beginners and non-professional may struggle when encountering a wide variety of computational tools and the sheer number of input parameter variables required by each tool.
Methods: We introduce a free-to-access graphical user interface (GUI) named PyPAn 'Python-based Protein Analysis' for varieties of protein sequence/structure analyses. PyPAn serves as a universal platform to analyze protein sequences, structure, and their properties. PyPAn facilitates onboard analysis of each task in just a single click. It can be used to calculate the physicochemical properties, including instability index and molar extinction coefficient, for a protein. PyPAn is one of the few computational tools that allow users to generate a Ramachandran plot and calculate solvent accessibility and the radius of gyration (Rg) of proteins at once. In addition, it can refine the protein model along with computation and minimization of its energy.
Results: PyPAn can generate a recommendation for an appropriate structure modelling method to employ for a query protein sequence. PyPAn is one of the few, if not the only, Python-based computational GUI tools with an array of options for the user to employ as they see fit.
Conclusion: PyPAn aims to unify many successful academically significant proteomic applications and is freely available for academic and industrial research uses at https://hassanlab.org/pypan.
Keywords: PyPAn, python-based protein analysis, protein analysis GUI, modelling method recommendation, protein sequence analysis, protein structure analysis, multiple protein sequence alignment, protein model refinement.
Graphical Abstract
[http://dx.doi.org/10.1093/nar/gkg520] [PMID: 12824332]
[http://dx.doi.org/10.1016/S0076-6879(03)74020-8] [PMID: 14696385]
[http://dx.doi.org/10.1093/nar/gkv236] [PMID: 25820431]
[http://dx.doi.org/10.1093/bioinformatics/18.9.1250] [PMID: 12217917]
[http://dx.doi.org/10.1002/prot.10529] [PMID: 14579342]
[http://dx.doi.org/10.1007/978-1-4939-0366-5_2] [PMID: 24573471]
[http://dx.doi.org/10.1038/nprot.2015.053] [PMID: 25950237]
[http://dx.doi.org/10.1002/prot.21945] [PMID: 18247410]
[http://dx.doi.org/10.1016/j.future.2019.04.011] [PMID: 31427836]
[http://dx.doi.org/10.1002/prot.24930] [PMID: 26370505]
[http://dx.doi.org/10.1186/1471-2105-15-S16-S7] [PMID: 25521245]
[http://dx.doi.org/10.1074/jbc.271.25.14657] [PMID: 8663297]
[http://dx.doi.org/10.1093/bioinformatics/btp163] [PMID: 19304878]
[PMID: 10027275]
[http://dx.doi.org/10.2174/0929866526666190228144219] [PMID: 30816075]
[http://dx.doi.org/10.1002/pro.3290] [PMID: 28884485]
[http://dx.doi.org/10.1186/1748-7188-5-21] [PMID: 20470396]
[http://dx.doi.org/10.1093/nar/gkn201]
[http://dx.doi.org/10.2174/1389203003381469] [PMID: 12369918]
[http://dx.doi.org/10.1016/j.ijbiomac.2015.12.072] [PMID: 26730484]
[http://dx.doi.org/10.1111/j.1399-3011.1996.tb00817.x] [PMID: 8907507]
[http://dx.doi.org/10.1007/s00894-009-0454-9] [PMID: 19234730]
[PMID: 18856071]
[http://dx.doi.org/10.1186/1758-2946-3-33] [PMID: 21982300]
[http://dx.doi.org/10.1385/1-59259-890-0:571]