Abstract
Background: Protein aggregation into β-sheet-enriched insoluble assemblies is being found to be associated with an increasing number of debilitating human pathologies, such as Alzheimer’s disease or type 2 diabetes, but also with premature aging. Furthermore, protein aggregation represents a major bottleneck in the production and marketing of proteinbased therapeutics. Thus, the development of methods to accurately forecast the aggregation propensity of a certain protein is of much value.
Methods/Results: A myriad of in vitro and in vivo aggregation studies have shown that the aggregation propensity of a certain polypeptide sequence is highly dependent on its intrinsic properties and, in most cases, driven by specific short regions of high aggregation propensity. These observations have fostered the development of a first generation of algorithms aimed to predict protein aggregation propensities from the protein sequence. A second generation of programs able to map protein aggregation on protein structures is emerging. Herein, we review the most representative online accessible predictive tools, emphasizing their main distinctive features and the range of applications.
Conclusion: In this review, we describe representative biocomputational approaches to evaluate the aggregation properties of protein sequences and structures, while illustrating how they can become very useful tools to target protein aggregation in biomedicine and biotechnology.
Keywords: Amyloid, bioinformatics, protein aggregation, protein structure, therapeutic proteins, biocomputational approaches.
[http://dx.doi.org/10.1146/annurev.biochem.75.101304.123901] [PMID: 16756495]
[http://dx.doi.org/10.1016/j.tibs.2007.03.003] [PMID: 17412596]
[http://dx.doi.org/10.1016/j.tibtech.2006.02.007] [PMID: 16503059]
[http://dx.doi.org/10.1208/aapsj080359] [PMID: 17025268]
[http://dx.doi.org/10.1073/pnas.0915166107] [PMID: 20133726]
[http://dx.doi.org/10.1073/pnas.212527999] [PMID: 12374855]
[http://dx.doi.org/10.1073/pnas.0308249101] [PMID: 15123800]
[http://dx.doi.org/10.1093/protein/gzp037] [PMID: 19602569]
[http://dx.doi.org/10.1016/j.jmb.2010.09.052] [PMID: 20887731]
[http://dx.doi.org/10.1038/nature01891] [PMID: 12917692]
[http://dx.doi.org/10.1016/j.jmb.2004.06.043] [PMID: 15302561]
[http://dx.doi.org/10.1016/j.jmb.2005.04.016] [PMID: 15925383]
[http://dx.doi.org/10.1039/b706784b] [PMID: 18568165]
[http://dx.doi.org/10.1016/j.jmb.2008.05.013] [PMID: 18514226]
[http://dx.doi.org/10.1016/j.str.2006.12.007] [PMID: 17292832]
[http://dx.doi.org/10.1016/S0022-2836(02)00442-4] [PMID: 12079393]
[http://dx.doi.org/10.1186/1472-6807-5-18] [PMID: 16197548]
[http://dx.doi.org/10.1111/j.1742-4658.2005.05102.x] [PMID: 16420488]
[http://dx.doi.org/10.1186/1471-2105-8-65] [PMID: 17324296]
[http://dx.doi.org/10.1093/nar/gkv359] [PMID: 25883144]
[http://dx.doi.org/10.1021/bi00699a001] [PMID: 4358939]
[http://dx.doi.org/10.1110/ps.062624507] [PMID: 17456743]
[http://dx.doi.org/10.1093/bioinformatics/btp691] [PMID: 20019059]
[http://dx.doi.org/10.1371/journal.pcbi.0020177] [PMID: 17196033]
[http://dx.doi.org/10.1073/pnas.041617698] [PMID: 11226247]
[http://dx.doi.org/10.1093/protein/gzp033] [PMID: 19589877]
[http://dx.doi.org/10.1093/protein/gzm042] [PMID: 17720750]
[http://dx.doi.org/10.1038/nature03680] [PMID: 15944695]
[http://dx.doi.org/10.1016/j.sbi.2006.03.007] [PMID: 16563741]
[http://dx.doi.org/10.1073/pnas.0511295103] [PMID: 16537487]
[http://dx.doi.org/10.1073/pnas.97.19.10383] [PMID: 10984534]
[http://dx.doi.org/10.1016/j.sbi.2006.01.011] [PMID: 16434184]
[http://dx.doi.org/10.1038/nmeth.1432] [PMID: 20154676]
[http://dx.doi.org/10.1073/pnas.0505905102] [PMID: 16263932]
[http://dx.doi.org/10.1038/nbt1012] [PMID: 15361882]
[http://dx.doi.org/10.1371/journal.pcbi.1004013] [PMID: 25569335]
[http://dx.doi.org/10.1371/journal.pone.0054175] [PMID: 23326595]
[http://dx.doi.org/10.1186/1472-6807-9-44] [PMID: 19589171]
[http://dx.doi.org/10.1371/journal.pone.0079722] [PMID: 24260292]
[http://dx.doi.org/10.1073/pnas.0904191106] [PMID: 19571001]
[http://dx.doi.org/10.1016/0003-2697(91)90045-U] [PMID: 2042744]
[http://dx.doi.org/10.3233/JAD-150046] [PMID: 26402770]
[http://dx.doi.org/10.1016/j.jmb.2014.09.026] [PMID: 25451785]
[http://dx.doi.org/10.1093/bioinformatics/btv162] [PMID: 25792555]
[http://dx.doi.org/10.1093/nar/gki387]
[http://dx.doi.org/10.1093/protein/gzw019] [PMID: 27284085]
[http://dx.doi.org/10.1093/nar/gkt332] [PMID: 23658222]
[http://dx.doi.org/10.1093/molbev/msn122] [PMID: 18503047]
[http://dx.doi.org/10.1016/j.jmb.2010.08.013] [PMID: 20709078]
[http://dx.doi.org/10.1016/j.celrep.2013.09.043] [PMID: 24183671]
[http://dx.doi.org/10.1016/j.cell.2015.03.032] [PMID: 25957690]
[http://dx.doi.org/10.1371/journal.pcbi.1000476] [PMID: 19696882]
[http://dx.doi.org/10.1002/jps.22705] [PMID: 21789769]
[http://dx.doi.org/10.1038/nri1837] [PMID: 16622479]
[http://dx.doi.org/10.1038/srep22258] [PMID: 26924748]
[http://dx.doi.org/10.1074/jbc.M109.017657] [PMID: 19602727]
[http://dx.doi.org/10.1016/j.bbapap.2016.12.008] [PMID: 28007442]
[http://dx.doi.org/10.1093/hmg/ddw091] [PMID: 27005423]
[http://dx.doi.org/10.1074/jbc.M114.620484] [PMID: 25616660]