Abstract
Aims: In this paper, Forkhead box O (FOXO) protein using the ensemble learning algorithm is predicted. When FOXO is in excess in the human body, it leads to LNCap prostate cancer cells, and if deficit leading neurodegenerative diseases.
Objective: Neurodegenerative diseases, like Alzheimer's and Parkinson's, are neurological illnesses that are caused by damaged brain cells. For prediction of FOXO protein, Gradient Boosted Machine (GBM) and Random forest (RF) techniques are used.
Method: The main idea of using GBM is its non-linear nature but it is difficult for any single decision tree to fit all training. To overcome this, an RF algorithm is used. RF combines the results at the end of the process by average or majority rules, while the GBM algorithm combines the results along the way.
Results: A total of 29.16% improvement has been observed by RF over GBM. Average square error is also evaluated to check the testing and training of data for 100 trees on 100 tree sizes.
Conclusion: In this paper, a computational model for the prediction of FOXO protein using ensemble learning techniques (Random Forest and GBM) has been proposed. If the dataset has many variable features and the prediction accuracy is not as important then RF can be considered. On the other hand, GBMs are better suited for datasets that have very few or fewer input features and where high accuracy predictions are required. However, there are instances when either GBM or RF can perform equally well depending on how they are tuned.
Keywords: System biology, degenerative diseases, diabetes, FKHR, random forest, boosted tree.
Graphical Abstract
[http://dx.doi.org/10.1371/journal.pone.0200929] [PMID: 30044828]
[http://dx.doi.org/10.3390/ijms17122029] [PMID: 27918441]
[http://dx.doi.org/10.1042/bj3490629] [PMID: 10880363]
[http://dx.doi.org/10.1007/978-1-59745-524-4_4] [PMID: 19623486]
[http://dx.doi.org/10.3389/fneur.2017.00527] [PMID: 29046661]
[http://dx.doi.org/10.1002/mds.25150] [PMID: 23097348]
[http://dx.doi.org/10.1007/s00228-009-0742-4] [PMID: 19834698]
[http://dx.doi.org/10.4236/apd.2016.51001]
[http://dx.doi.org/10.2337/db08-1001] [PMID: 19289458]
[http://dx.doi.org/10.1038/onc.2008.29] [PMID: 18391968]
[http://dx.doi.org/10.1006/geno.1997.5122] [PMID: 9479491]
[http://dx.doi.org/10.1074/jbc.M302804200] [PMID: 12857750]
[http://dx.doi.org/10.1002/jbmr.2729] [PMID: 26462119]
[http://dx.doi.org/10.1038/onc.2008.27] [PMID: 18391976]
[http://dx.doi.org/10.1172/JCI60329] [PMID: 22326951]
[http://dx.doi.org/10.2174/156720211795495402] [PMID: 21443457]
[http://dx.doi.org/10.1074/mcp.M500158-MCP200] [PMID: 16030008]
[http://dx.doi.org/10.1074/jbc.M114.625715] [PMID: 25691571]
[http://dx.doi.org/10.1074/jbc.M900461200] [PMID: 19221179]
[http://dx.doi.org/10.1504/IJAISC.2017.084232]
[http://dx.doi.org/10.1016/j.cell.2017.09.045] [PMID: 29056338]
[http://dx.doi.org/10.1179/1743132813Y.0000000284] [PMID: 24512015]
[http://dx.doi.org/10.1016/j.csbj.2019.06.011] [PMID: 31303978]
[http://dx.doi.org/10.1007/s10120-013-0314-2] [PMID: 24202965]
[http://dx.doi.org/10.1210/en.2003-1199] [PMID: 14701673]
[http://dx.doi.org/10.1109/MCS.2009.932926]
[PMID: 34979024]
[http://dx.doi.org/10.1162/neco.1997.9.7.1545]
[http://dx.doi.org/10.1016/j.jneumeth.2013.08.024] [PMID: 24012917]
[http://dx.doi.org/10.1007/978-3-030-20951-3_18]