Abstract
For two decades, Rosetta has consistently been at the forefront of protein structure prediction. While it has become a very large package comprising programs, scripts, and tools, for different types of macromolecular modelling such as ligand docking, protein-protein docking, protein design, and loop modelling, it started as the implementation of an algorithm for ab initio protein structure prediction. The term ’Rosetta’ appeared for the first time twenty years ago in the literature to describe that algorithm and its contribution to the third edition of the community wide Critical Assessment of techniques for protein Structure Prediction (CASP3). Similar to the Rosetta stone that allowed deciphering the ancient Egyptian civilisation, David Baker and his co-workers have been contributing to deciphering ’the second half of the genetic code’. Although the focus of Baker’s team has expended to de novo protein design in the past few years, Rosetta’s ‘fame’ is associated with its fragment-assembly protein structure prediction approach. Following a presentation of the main concepts underpinning its foundation, especially sequence-structure correlation and usage of fragments, we review the main stages of its developments and highlight the milestones it has achieved in terms of protein structure prediction, particularly in CASP.
Keywords: Rosetta, protein structure prediction, fragment assembly, CASP, ligand docking, algorithm.
Graphical Abstract
[http://dx.doi.org/10.1007/978-1-61779-465-0_10]
[http://dx.doi.org/10.1038/nmeth0809-551]
[http://dx.doi.org/10.1006/jmbi.1997.0959] [PMID: 9149153]
[http://dx.doi.org/10.1038/s41586-019-1432-8] [PMID: 31341284]
[http://dx.doi.org/10.1002/pro.3588]
[http://dx.doi.org/10.1038/s41594-018-0141-6] [PMID: 30374087]
[http://dx.doi.org/10.1126/science.aaq1739]
[http://dx.doi.org/10.1038/nature23912] [PMID: 28953867]
[http://dx.doi.org/10.1038/nature19946] [PMID: 27629638]
[http://dx.doi.org/10.1002/(SICI)1097-0134(1999)37:3+<171:AID-PROT21>3.0.CO;2-Z] [PMID: 10526365]
[http://dx.doi.org/10.1002/(SICI)1097-0134(1999)37:3+<149:AID-PROT20>3.0.CO;2-H]
[http://dx.doi.org/10.1146/annurev.biophys.37.092707.153558]
[http://dx.doi.org/10.1016/S1359-0278(97)00067-9] [PMID: 9269572]
[http://dx.doi.org/10.1021/bi00483a001]
[http://dx.doi.org/10.1107/S2059798317008920] [PMID: 28777078]
[http://dx.doi.org/10.1021/acs.chemrev.6b00163]
[http://dx.doi.org/10.1142/S021797921840009X] [PMID: 30853739]
[http://dx.doi.org/10.1073/pnas.91.10.4436]] [PMID: 8183927]
[http://dx.doi.org/10.1371/journal.pone.0092197]
[http://dx.doi.org/10.1038/nmeth.3213]
[http://dx.doi.org/10.1002/prot.24065]
[http://dx.doi.org/10.2174/0929866523666161216124019 PMID: 27993124]
[http://dx.doi.org/10.1073/pnas.93.12.5814]
[http://dx.doi.org/10.1016/S0076-6879(04)83004-0 PMID: 15063647]
[http://dx.doi.org/10.1073/pnas.93.12.5814]
[http://dx.doi.org/10.1006/jmbi.1995.0424] [PMID: 7643386]
[http://dx.doi.org/10.1016/S0958-1669(96)80117-0] [PMID: 8768900]
[http://dx.doi.org/10.1002/(SICI)1097-0134(19990101)34:1<82:AID-PROT7>3.0.CO;2-A PMID: 10336385]
[http://dx.doi.org/10.1016/B978-0-12-394292-0.00006-0]
[http://dx.doi.org/10.1021/acs.jctc.7b00125] [PMID: 28430426]
[http://dx.doi.org/10.1002/prot.20729] [PMID: 16187354]
[http://dx.doi.org/10.1002/prot.20733]
[http://dx.doi.org/10.1002/prot.22540]
[http://dx.doi.org/10.1002/(SICI)1097-0134(19990501)35:2<133:AID-PROT1>3.0.CO;2-N]
[http://dx.doi.org/10.1371/journal.pone.0063906] [PMID: 23717507]
[http://dx.doi.org/10.1002/pro.5560060807]
[http://dx.doi.org/10.1016/j.str.2011.03.019]
[http://dx.doi.org/10.1093/bioinformatics/16.4.404 PMID: 10869041]
[http://dx.doi.org/10.1002/prot.24258]
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
[http://dx.doi.org/10.1126/science.220.4598.671] [PMID: 17813860]
[http://dx.doi.org/10.1063/1.1699114]
[http://dx.doi.org/10.1093/nar/gkh468] [PMID: 15215442]
[http://dx.doi.org/10.1016/j.str.2013.08.005] [PMID: 24035711]
[http://dx.doi.org/10.1110/ps.062270707] [PMID: 17189483]
[http://dx.doi.org/10.1093/bioinformatics/bti125] [PMID: 15531603]
[http://dx.doi.org/10.1093/bioinformatics/btr350]
[http://dx.doi.org/10.1016/S0022-2836(05)80360-2] [PMID: 2231712]
[http://dx.doi.org/10.1110/ps.9.8.1487]
[http://dx.doi.org/10.1126/science.aah4043]
[http://dx.doi.org/10.1093/bioinformatics/btv767] [PMID: 26733453]
[http://dx.doi.org/10.1038/nature09304] [PMID: 20686574]
[http://dx.doi.org/10.1145/1822348.1822354]
[http://dx.doi.org/10.1038/nbt.2109] [PMID: 22267011]
[http://dx.doi.org/10.1107/S0907444911035943] [PMID: 22101816]
[http://dx.doi.org/10.1073/pnas.1115898108] [PMID: 22065763]
[http://dx.doi.org/10.1093/bioinformatics/btx283]
[http://dx.doi.org/10.1073/pnas.0305695101] [PMID: 15126668]
[http://dx.doi.org/10.1529/biophysj.107.109959] [PMID: 17496016]
[http://dx.doi.org/10.1186/1741-7007-5-17]
[http://dx.doi.org/10.1002/9781118617151.ch32]
[http://dx.doi.org/10.1038/nprot.2010.5]
[http://dx.doi.org/10.1002/prot.10141] [PMID: 12112688]
[http://dx.doi.org/10.1016/S0969-2126(02)00700-1]
[http://dx.doi.org/10.1002/prot.23190]
[http://dx.doi.org/10.1002/prot.21686] [PMID: 17654725]
[http://dx.doi.org/10.1002/prot.25775]
[http://dx.doi.org/10.1002/prot.10546] [PMID: 14579333]
[http://dx.doi.org/10.1002/prot.21669]
[http://dx.doi.org/10.1002/prot.25064] [PMID: 27171127]
[http://dx.doi.org/10.1002/(SICI)1097-0134(1997)1+<151:AID-PROT20>3.0.CO;2-M] [PMID: 9485507]
[http://dx.doi.org/10.1002/prot.10056] [PMID: 11835487]
[http://dx.doi.org/10.1002/prot.1170] [PMID: 11835488]
[http://dx.doi.org/10.1002/prot.20722]
[http://dx.doi.org/10.1002/prot.21771]
[http://dx.doi.org/10.1002/prot.22591] [PMID: 19774550]
[http://dx.doi.org/10.1002/prot.23181] [PMID: 21997521]
[http://dx.doi.org/10.1002/prot.24470] [PMID: 24343678]
[http://dx.doi.org/10.1002/prot.24973] [PMID: 26677002]
[http://dx.doi.org/10.1093/nar/gkv357]
[http://dx.doi.org/10.1002/prot.24341]
[http://dx.doi.org/10.1002/prot.24974]]
[http://dx.doi.org/10.1002/prot.24975] [PMID: 26677100]
[http://dx.doi.org/10.7554/eLife.02030] [PMID: 24842992]
[http://dx.doi.org/10.1110/ps.036442.108]
[http://dx.doi.org/10.1002/prot.25274] [PMID: 28241391]
[http://dx.doi.org/10.1002/prot.24987]
[http://dx.doi.org/10.1371/journal.pone.0049240] [PMID: 23173050]
[http://dx.doi.org/10.1186/1472-6807-13-2] [PMID: 23442819]
[http://dx.doi.org/10.1002/prot.24587]]
[http://dx.doi.org/10.1371/journal.pone.0068954] [PMID: 23935913]
[http://dx.doi.org/10.1186/s12859-015-0576-2]
[http://dx.doi.org/10.1002/prot.25415] [PMID: 29082672]
[http://dx.doi.org/10.1002/prot.25390] [PMID: 28940798]
[http://dx.doi.org/10.1002/prot.25824] [PMID: 31589782]
[http://dx.doi.org/10.1002/prot.25834] [PMID: 31602685]
[http://dx.doi.org/10.1002/pro.2389] [PMID: 24265211]