Abstract
Lots of cold-adapted organisms could produce antifreeze proteins (AFPs) to counter the freezing of cell fluids by controlling the growth of ice crystal. AFPs have been found in various species such as in vertebrates, invertebrates, plants, bacteria, and fungi. These AFPs from fish, insects and plants displayed a high diversity. Thus, the identification of the AFPs is a challenging task in computational proteomics. With the accumulation of AFPs and development of machine meaning methods, it is possible to construct a high-throughput tool to timely identify the AFPs. In this review, we briefly reviewed the application of machine learning methods in antifreeze proteins identification from difference section, including published benchmark dataset, sequence descriptor, classification algorithms and published methods. We hope that this review will produce new ideas and directions for the researches in identifying antifreeze proteins.
Keywords: Antifreeze protein, classification, machine learning, computational proteomics, cold-adapted organisms, cell fluids.
Graphical Abstract
[http://dx.doi.org/10.1073/pnas.94.8.3485] [PMID: 9108001]
[http://dx.doi.org/10.1007/s000180050289] [PMID: 10188586]
[http://dx.doi.org/10.1016/S0959-437X(98)80042-7] [PMID: 9914209]
[http://dx.doi.org/10.1016/S0959-440X(97)80154-6] [PMID: 9434903]
[http://dx.doi.org/10.1016/0022-2836(92)90666-8] [PMID: 1738160]
[http://dx.doi.org/10.1034/j.1399-3054.2001.1120111.x] [PMID: 11319018]
[http://dx.doi.org/10.1098/rstb.2002.1081] [PMID: 12171656]
[http://dx.doi.org/10.1016/0167-4838(92)90355-H] [PMID: 1599942]
[http://dx.doi.org/10.1002/jcp.1030490103]
[http://dx.doi.org/10.1096/fasebj.4.8.2185972] [PMID: 2185972]
[http://dx.doi.org/10.1038/35018610] [PMID: 10917537]
[http://dx.doi.org/10.1093/bib/bby053] [PMID: 29947743]
[http://dx.doi.org/10.1016/j.jtbi.2014.04.006] [PMID: 24732262]
[http://dx.doi.org/10.1016/j.ab.2014.04.032] [PMID: 24802134]
[http://dx.doi.org/10.1038/s41598-017-06195-y] [PMID: 28724993]
[http://dx.doi.org/10.1186/s12864-017-4338-6] [PMID: 29363423]
[http://dx.doi.org/10.1186/s12859-018-2098-1] [PMID: 29671398]
[http://dx.doi.org/10.1002/prot.25697] [PMID: 30985027]
[http://dx.doi.org/10.1016/j.jtbi.2010.10.037] [PMID: 21056045]
[http://dx.doi.org/10.1371/journal.pone.0020445] [PMID: 21655262]
[http://dx.doi.org/10.3390/ijms13022196] [PMID: 22408447]
[http://dx.doi.org/10.1007/s00232-015-9811-z] [PMID: 26058944]
[http://dx.doi.org/10.1007/s00232-016-9935-9] [PMID: 27812737]
[http://dx.doi.org/10.1109/TCBB.2016.2617337] [PMID: 28113406]
[PMID: 29106639]
[PMID: 27543076]
[http://dx.doi.org/10.1093/nar/gkw1052] [PMID: 27899615]
[http://dx.doi.org/10.1093/bioinformatics/btx223] [PMID: 28419194]
[PMID: 28171531]
[http://dx.doi.org/10.1038/s41598-017-08115-6] [PMID: 28784999]
[http://dx.doi.org/10.1093/nar/gkv1100] [PMID: 26503249]
[http://dx.doi.org/10.1093/nar/gku315]
[http://dx.doi.org/10.1093/nar/gky1051] [PMID: 30380072]
[http://dx.doi.org/10.2174/1566523218666181010101114] [PMID: 30306867]
[http://dx.doi.org/10.1002/(SICI)1097-0134(199707)28:3<405:AID-PROT10>3.0.CO;2-L] [PMID: 9223186]
[http://dx.doi.org/10.1093/bioinformatics/17.3.282] [PMID: 11294794]
[http://dx.doi.org/10.1093/bib/bby090] [PMID: 30239587]
[http://dx.doi.org/10.1093/nar/28.1.235] [PMID: 10592235]
[http://dx.doi.org/10.1093/bioinformatics/btg224] [PMID: 12912846]
[http://dx.doi.org/10.1093/bioinformatics/btm404] [PMID: 17846036]
[PMID: 30378494]
[http://dx.doi.org/10.1155/2016/5413903] [PMID: 27597968]
[http://dx.doi.org/10.1039/C5MB00883B] [PMID: 26883492]
[http://dx.doi.org/10.1155/2016/1654623] [PMID: 27437396]
[http://dx.doi.org/10.1039/C4MB00645C] [PMID: 25437899]
[http://dx.doi.org/10.1016/j.bbrc.2008.01.038] [PMID: 18206645]
[http://dx.doi.org/10.1093/bioinformatics/16.4.404] [PMID: 10869041]
[http://dx.doi.org/10.1093/nar/28.1.374] [PMID: 10592278]
[http://dx.doi.org/10.1002/prot.21018] [PMID: 16752418]
[http://dx.doi.org/10.1155/2013/530696]
[http://dx.doi.org/10.1155/2013/567529]
[http://dx.doi.org/10.1093/bioinformatics/btw564] [PMID: 27565583]
[http://dx.doi.org/10.3934/mbe.2019123] [PMID: 31137222]
[http://dx.doi.org/10.1142/S1793524513500034]
[http://dx.doi.org/10.1093/bioinformatics/btl677] [PMID: 17237066]
[http://dx.doi.org/10.1007/s00726-009-0381-1] [PMID: 19908123]
[http://dx.doi.org/10.1016/j.ins.2016.06.026]
[http://dx.doi.org/10.1093/nar/29.14.2994] [PMID: 11452024]
[http://dx.doi.org/10.1093/nar/25.17.3389] [PMID: 9254694]
[http://dx.doi.org/10.1002/prot.1035] [PMID: 11288174]
[http://dx.doi.org/10.1016/j.jtbi.2010.12.024] [PMID: 21168420]
[http://dx.doi.org/10.2174/092986609787848045] [PMID: 19356130]
[http://dx.doi.org/10.1016/0003-2670(93)80437-P]
[http://dx.doi.org/10.1109/CSB.2003.1227396]
[http://dx.doi.org/10.1016/j.neucom.2014.12.123]
[http://dx.doi.org/10.1186/s12918-016-0353-5] [PMID: 28155714]
[http://dx.doi.org/10.1093/bioinformatics/bty522] [PMID: 29947803]
[http://dx.doi.org/10.1093/bioinformatics/bty140] [PMID: 29528364]
[PMID: 30247625]
[http://dx.doi.org/10.1016/j.bbrc.2016.06.035] [PMID: 27291150]
[http://dx.doi.org/10.1186/1471-2164-9-S2-S27] [PMID: 18831793]
[http://dx.doi.org/10.1093/bioinformatics/bth261] [PMID: 15073010]
[http://dx.doi.org/10.1002/prot.21309]
[http://dx.doi.org/10.1007/s11704-014-4089-3]
[http://dx.doi.org/10.1016/j.bbrc.2009.12.019] [PMID: 19995554]
[PMID: 30124147]
[http://dx.doi.org/10.1142/S1793524517500504]
[PMID: 28035027]
[http://dx.doi.org/10.1016/j.jtbi.2017.03.031] [PMID: 28411111]
[http://dx.doi.org/10.1093/bioinformatics/btu852] [PMID: 25568279]
[http://dx.doi.org/10.1093/bioinformatics/btt603] [PMID: 24149049]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.3389/fgene.2018.00433] [PMID: 30327665]
[http://dx.doi.org/10.1016/j.ymeth.2019.02.009] [PMID: 30772464]
[http://dx.doi.org/10.2174/157016461104150121115154]
[http://dx.doi.org/10.1093/bib/bbz048] [PMID: 31157855]
[http://dx.doi.org/10.1371/journal.pone.0106542] [PMID: 25222008]
[http://dx.doi.org/10.3389/fimmu.2018.01783] [PMID: 30108593]
[http://dx.doi.org/10.1093/bioinformatics/btz015] [PMID: 30624619]
[http://dx.doi.org/10.1016/j.ab.2013.05.024] [PMID: 23756733]
[http://dx.doi.org/10.2174/1574893611666160608102537]
[http://dx.doi.org/10.2174/1574893611666160608075753]
[http://dx.doi.org/10.2174/157016461302160514000940]
[http://dx.doi.org/10.18632/oncotarget.15963] [PMID: 28423655]
[http://dx.doi.org/10.1186/1471-2105-15-120] [PMID: 24776231]
[PMID: 29416743]
[http://dx.doi.org/10.3389/fmicb.2018.00476] [PMID: 29616000]
[http://dx.doi.org/10.1093/bioinformatics/btq043] [PMID: 20130033]
[http://dx.doi.org/10.1371/journal.pone.0007072] [PMID: 19759917]
[http://dx.doi.org/10.18632/oncotarget.20365] [PMID: 29100375]
[http://dx.doi.org/10.1093/bioinformatics/bty1047] [PMID: 30590410]
[http://dx.doi.org/10.1093/bioinformatics/btx222] [PMID: 28419290]
[http://dx.doi.org/10.1145/1961189.1961199]
[http://dx.doi.org/10.1016/j.knosys.2018.10.007]
[http://dx.doi.org/10.7150/ijbs.24174] [PMID: 29989085]
[http://dx.doi.org/10.1007/BF00993106]
[PMID: 30428009]
[http://dx.doi.org/10.1016/j.ygeno.2016.02.005] [PMID: 26921858]
[http://dx.doi.org/10.1016/j.csbj.2018.10.007] [PMID: 30425802]
[http://dx.doi.org/10.3389/fimmu.2018.01695] [PMID: 30100904]
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]
[http://dx.doi.org/10.1093/bioinformatics/btx479] [PMID: 28961687]
[http://dx.doi.org/10.1371/journal.pone.0145541] [PMID: 26713618]
[http://dx.doi.org/10.1093/bioinformatics/bty002] [PMID: 29365045]
[http://dx.doi.org/10.1021/acs.jproteome.8b00148] [PMID: 29893128]
[http://dx.doi.org/10.1109/TCBB.2018.2816032] [PMID: 29993815]
[http://dx.doi.org/10.1142/S0219720005001004] [PMID: 15852500]
[http://dx.doi.org/10.1016/j.neucom.2018.04.082]
[http://dx.doi.org/10.1016/j.jpdc.2017.08.009]
[http://dx.doi.org/10.2174/1574893612666170707095707]
[http://dx.doi.org/10.1093/bfgp/ely1030] [PMID: 30265280]
[http://dx.doi.org/10.3390/molecules22101732] [PMID: 29039790]
[http://dx.doi.org/10.1186/s12859-016-1405-y] [PMID: 27919220]
[http://dx.doi.org/10.1016/j.ymeth.2015.09.011] [PMID: 26370280]