Abstract
Background: The expression of secretory proteins is involved in each stage of biomass from fetal development to the immune response. As an animal model for the study of human diseases, the study of protein secretion in pigs has strong application prospects.
Objective: Although secretory proteins play an important role in cell activities, there are no machine learning-based approaches for the prediction of pig secretory proteins. This study aims to establish a prediction model for identifying the secretory protein in Sus scrofa.
Methods: Based on the pseudo composition of k-spaced amino acid pairs feature encoding method and support vector machine algorithm, a prediction model was established for the identification of the secretory protein in Sus scrofa.
Results: The model produced the AUROC of 0.885 and 0.728 on the training set and independent testing set, respectively. In addition, we discussed features used for the prediction.
Conclusion: In this study, we proposed the first classification model to identify secretory proteins in Sus scrofa. By learning the characteristic of secretory proteins, it may become feasible to design and produce secretory proteins with distinctive properties that are currently unavailable.
Graphical Abstract
[http://dx.doi.org/10.1111/petr.14260] [PMID: 35233893]
[http://dx.doi.org/10.1016/j.actbio.2017.11.054] [PMID: 29225150]
[http://dx.doi.org/10.1016/j.tim.2011.11.002] [PMID: 22153753]
[http://dx.doi.org/10.1093/nar/gkab638] [PMID: 34331449]
[http://dx.doi.org/10.1093/bioinformatics/btab036] [PMID: 33471060]
[http://dx.doi.org/10.1093/bfgp/elaa023] [PMID: 33313647]
[http://dx.doi.org/10.3390/cells10113242] [PMID: 34831463]
[http://dx.doi.org/10.1017/S0954422421000019] [PMID: 33461642]
[http://dx.doi.org/10.1016/j.cmet.2010.09.015] [PMID: 21035759]
[http://dx.doi.org/10.3748/wjg.v27.i30.5047] [PMID: 34497434]
[http://dx.doi.org/10.7150/ijbs.59149] [PMID: 33907512]
[http://dx.doi.org/10.7150/ijbs.72706] [PMID: 35982905]
[http://dx.doi.org/10.1016/j.aninu.2021.01.001] [PMID: 34258416]
[http://dx.doi.org/10.1016/j.molimm.2022.02.007] [PMID: 35184022]
[http://dx.doi.org/10.3390/ijms22094846] [PMID: 34063669]
[http://dx.doi.org/10.2174/1389203718666170829120729] [PMID: 28847289]
[http://dx.doi.org/10.1093/humupd/dmaa016] [PMID: 32378701]
[http://dx.doi.org/10.1051/rnd:2002031] [PMID: 12510876]
[http://dx.doi.org/10.1007/BF00186463] [PMID: 2075914]
[http://dx.doi.org/10.1002/imt2.42] [PMID: 36245702]
[http://dx.doi.org/10.1164/rccm.201807-1345OC] [PMID: 30543455]
[http://dx.doi.org/10.3390/microorganisms8111718] [PMID: 33147871]
[http://dx.doi.org/10.3389/fpls.2021.506681] [PMID: 33732270]
[http://dx.doi.org/10.1186/s12864-016-3097-0] [PMID: 28185571]
[http://dx.doi.org/10.1371/journal.pcbi.1010610] [PMID: 36260616]
[http://dx.doi.org/10.1109/TCBB.2020.2966633] [PMID: 31944984]
[http://dx.doi.org/10.48550/arXiv.1809.04461]
[http://dx.doi.org/10.1186/s12859-021-04446-4] [PMID: 34753427]
[http://dx.doi.org/10.1093/bib/bbab416] [PMID: 34623382]
[http://dx.doi.org/10.1093/bib/bbab346] [PMID: 34661237]
[http://dx.doi.org/10.3389/fcell.2020.591487] [PMID: 33195258]
[http://dx.doi.org/10.1093/nar/gkab829] [PMID: 34581805]
[http://dx.doi.org/10.1093/bioinformatics/btac106] [PMID: 35176130]
[http://dx.doi.org/10.1016/j.compbiomed.2021.105006] [PMID: 34749096]
[http://dx.doi.org/10.1016/j.compbiomed.2021.104243] [PMID: 33550014]
[http://dx.doi.org/10.3390/ijms22169054] [PMID: 34445760]
[http://dx.doi.org/10.1093/bib/bbab376] [PMID: 34532736]
[http://dx.doi.org/10.1016/j.jmb.2022.167604] [PMID: 35662468]
[http://dx.doi.org/10.1016/j.csbj.2022.07.043] [PMID: 36051870]
[http://dx.doi.org/10.1155/2021/6664362] [PMID: 33505515]
[http://dx.doi.org/10.1016/j.omtn.2019.05.028] [PMID: 31299595]
[http://dx.doi.org/10.1002/prot.1035] [PMID: 11288174]
[http://dx.doi.org/10.2174/1574893615666210108094431]
[http://dx.doi.org/10.1093/nar/gkz740] [PMID: 31504851]
[http://dx.doi.org/10.2174/2212392XMTA3wMTIj0]
[http://dx.doi.org/10.1016/j.csbj.2022.08.053] [PMID: 36147670]
[http://dx.doi.org/10.7150/ijbs.24174] [PMID: 29989085]
[http://dx.doi.org/10.2174/1574893616666211007102747]
[http://dx.doi.org/10.1093/bioinformatics/bty002] [PMID: 29365045]
[http://dx.doi.org/10.1093/nar/gkab016] [PMID: 33503258]
[http://dx.doi.org/10.1007/s00438-021-01789-8] [PMID: 33914130]
[http://dx.doi.org/10.1109/TPAMI.2005.159] [PMID: 16119262]
[http://dx.doi.org/10.1093/bib/bbab501] [PMID: 34864886]
[http://dx.doi.org/10.1093/bib/bbab486] [PMID: 34864888]
[http://dx.doi.org/10.1016/j.ymeth.2021.05.016] [PMID: 34033879]
[PMID: 34500458]
[http://dx.doi.org/10.1371/journal.pcbi.1010404] [PMID: 35969645]
[http://dx.doi.org/10.2174/1574893616666210827095829]
[http://dx.doi.org/10.1016/j.compbiomed.2020.104172] [PMID: 33352307]
[http://dx.doi.org/10.1093/bib/bbab480] [PMID: 34850821]
[http://dx.doi.org/10.2174/2212392XMTA3bMTYiy]
[http://dx.doi.org/10.1016/j.compbiomed.2020.103722] [PMID: 32250854]
[http://dx.doi.org/10.1093/bib/bbac395] [PMID: 36070864]
[http://dx.doi.org/10.1093/bib/bbac240] [PMID: 35817303]
[http://dx.doi.org/10.1016/j.inffus.2021.02.015]
[http://dx.doi.org/10.1093/bioinformatics/btz694] [PMID: 31588505]
[http://dx.doi.org/10.1093/bib/bbaa395] [PMID: 33415328]
[http://dx.doi.org/10.1371/journal.pcbi.1008696] [PMID: 33561121]
[http://dx.doi.org/10.1016/j.omtn.2019.04.019] [PMID: 31146255]
[http://dx.doi.org/10.1093/bib/bby124] [PMID: 30649170]
[http://dx.doi.org/10.1109/TCBB.2013.146] [PMID: 26355518]
[http://dx.doi.org/10.1016/j.artmed.2017.02.005] [PMID: 28245947]
[http://dx.doi.org/10.1093/bib/bbab364] [PMID: 34505623]
[http://dx.doi.org/10.1073/pnas.2102960118] [PMID: 33972411]
[PMID: 34802404]
[http://dx.doi.org/10.1093/bioinformatics/btab810] [PMID: 34864847]
[http://dx.doi.org/10.1093/bioinformatics/btaa667] [PMID: 32702119]
[http://dx.doi.org/10.1504/IJDMB.2013.056078] [PMID: 24417022]
[http://dx.doi.org/10.1093/bioinformatics/btac538] [PMID: 35904544]
[PMID: 34254917]
[http://dx.doi.org/10.1093/bib/bbab023] [PMID: 33693454]
[http://dx.doi.org/10.1155/2020/8926750] [PMID: 33133228]
[http://dx.doi.org/10.3389/fbioe.2020.584807] [PMID: 33195148]
[http://dx.doi.org/10.1093/bib/bbab335] [PMID: 34415016]
[http://dx.doi.org/10.2217/epi-2019-0321] [PMID: 32921165]
[http://dx.doi.org/10.1093/bib/bbab252] [PMID: 34226917]
[http://dx.doi.org/10.1016/j.ymthe.2022.05.001] [PMID: 35526094]
[http://dx.doi.org/10.1016/j.jmb.2021.166860] [PMID: 33539888]
[http://dx.doi.org/10.1093/nar/gkz843] [PMID: 31584099]
[PMID: 34161210]
[http://dx.doi.org/10.1093/nar/gkaa1100] [PMID: 33237286]