Abstract
Nuclear receptors (NRs) are a superfamily of ligand-dependent transcription factors that are closely related to cell development, differentiation, reproduction, homeostasis, and metabolism. According to the alignments of the conserved domains, NRs are classified and assigned the following seven subfamilies or eight subfamilies: (1) NR1: thyroid hormone like (thyroid hormone, retinoic acid, RAR-related orphan receptor, peroxisome proliferator activated, vitamin D3- like), (2) NR2: HNF4-like (hepatocyte nuclear factor 4, retinoic acid X, tailless-like, COUP-TFlike, USP), (3) NR3: estrogen-like (estrogen, estrogen-related, glucocorticoid-like), (4) NR4: nerve growth factor IB-like (NGFI-B-like), (5) NR5: fushi tarazu-F1 like (fushi tarazu-F1 like), (6) NR6: germ cell nuclear factor like (germ cell nuclear factor), and (7) NR0: knirps like (knirps, knirpsrelated, embryonic gonad protein, ODR7, trithorax) and DAX like (DAX, SHP), or dividing NR0 into (7) NR7: knirps like and (8) NR8: DAX like. Different NRs families have different structural features and functions. Since the function of a NR is closely correlated with which subfamily it belongs to, it is highly desirable to identify NRs and their subfamilies rapidly and effectively. The knowledge acquired is essential for a proper understanding of normal and abnormal cellular mechanisms. With the advent of the post-genomics era, huge amounts of sequence-known proteins have increased explosively. Conventional methods for accurately classifying the family of NRs are experimental means with high cost and low efficiency. Therefore, it has created a greater need for bioinformatics tools to effectively recognize NRs and their subfamilies for the purpose of understanding their biological function. In this review, we summarized the application of machine learning methods in the prediction of NRs from different aspects. We hope that this review will provide a reference for further research on the classification of NRs and their families.
Keywords: Nuclear receptors (NRs), NRs families, prediction, classification, machine learning methods, feature selection.
Graphical Abstract
[http://dx.doi.org/10.18632/oncotarget.15178] [PMID: 28187442]
[http://dx.doi.org/10.1186/s12859-015-0828-1] [PMID: 26630876]
[http://dx.doi.org/10.1016/j.ab.2009.01.018] [PMID: 19454254]
[http://dx.doi.org/10.1016/S1043-2760(01)00502-1] [PMID: 11701345]
[http://dx.doi.org/10.1016/0092-8674(95)90199-X]] [PMID: 8521507]
[PMID: 8681033]
[http://dx.doi.org/10.1172/JCI92949] [PMID: 28368290]
[http://dx.doi.org/10.3389/fgene.2018.00657] [PMID: 30619477]
[http://dx.doi.org/10.1093/nar/gky1051] [PMID: 30380072]
[http://dx.doi.org/10.1074/jbc.M401932200] [PMID: 15039428]
[http://dx.doi.org/10.1093/nar/29.1.346] [PMID: 11125133]
[http://dx.doi.org/10.1242/jcs.00247] [PMID: 12538758]
[http://dx.doi.org/10.1016/S0092-8674(00)80726-6] [PMID: 10219237]
[http://dx.doi.org/10.1677/jme.0.0190207]] [PMID: 9460643]
[http://dx.doi.org/10.1371/journal.pone.0023505] [PMID: 21858146]
[http://dx.doi.org/10.1371/journal.pone.0030869] [PMID: 22363503]
[http://dx.doi.org/10.1038/srep06810] [PMID: 25351274]
[http://dx.doi.org/10.1016/j.ygeno.2019.02.006] [PMID: 30779939]
[http://dx.doi.org/10.1093/nar/gkr960] [PMID: 22064856]
[http://dx.doi.org/10.1093/bioinformatics/btg449] [PMID: 14990452]
[http://dx.doi.org/10.1093/nar/gkg505] [PMID: 12824335]
[http://dx.doi.org/10.1021/ci010385k] [PMID: 12132896]
[PMID: 25348405]
[http://dx.doi.org/10.1002/0471250953.bi0129s53]
[http://dx.doi.org/10.1093/nar/gkw1099] [PMID: 27899622]
[http://dx.doi.org/10.1093/bioinformatics/btq003] [PMID: 20053844]
[http://dx.doi.org/10.1093/bioinformatics/18.1.77] [PMID: 11836214]
[http://dx.doi.org/10.1093/bioinformatics/17.3.282] [PMID: 11294794]
[http://dx.doi.org/10.1093/bioinformatics/btl158 ] [PMID: 16731699]
[http://dx.doi.org/10.1093/bib/bby090]
[http://dx.doi.org/10.1016/0895-7177(92)90150-J]
[http://dx.doi.org/10.1093/bib/bby053] [PMID: 29947743]
[http://dx.doi.org/10.3390/molecules22101732] [PMID: 29039790]
[PMID: 8071322]
[http://dx.doi.org/10.2220/biomedres.37.251] [PMID: 27545001]
[http://dx.doi.org/10.1186/s12859-016-1405-y] [PMID: 27919220]
[http://dx.doi.org/10.1016/j.ymeth.2015.09.011] [PMID: 26370280]
[http://dx.doi.org/10.1016/j.neucom.2016.03.025]
[http://dx.doi.org/10.1016/j.jtbi.2018.11.010] [PMID: 30452961]
[http://dx.doi.org/10.1016/j.csbj.2018.10.007] [PMID: 30425802]
[http://dx.doi.org/10.3389/fimmu.2018.01695] [PMID: 30100904]
[http://dx.doi.org/10.3389/fimmu.2018.01783] [PMID: 30108593]
[http://dx.doi.org/10.1049/iet-com.2011.0170]
[http://dx.doi.org/10.1155/2014/286419] [PMID: 24991545]
[http://dx.doi.org/10.18632/oncotarget.15963] [PMID: 28423655]
[http://dx.doi.org/10.1007/s10441-013-9181-9] [PMID: 23475502]
[http://dx.doi.org/10.1039/C4MB00645C] [PMID: 25437899]
[http://dx.doi.org/10.1016/j.jprot.2012.09.006] [PMID: 23000219]
[http://dx.doi.org/10.3390/ijms150712940] [PMID: 25054318]
[http://dx.doi.org/10.1007/s00726-007-0018-1] [PMID: 18175047]
[http://dx.doi.org/10.1109/TCBB.2011.117]
[http://dx.doi.org/10.1016/j.ab.2009.04.009] [PMID: 19364489]
[http://dx.doi.org/10.2174/157340613804488341] [PMID: 22931491]
[http://dx.doi.org/10.1007/s12250-016-3740-6] [PMID: 27151186]
[http://dx.doi.org/10.2174/1574893613666181113131415]
[http://dx.doi.org/10.1093/bioinformatics/btw564] [PMID: 27565583]
[http://dx.doi.org/10.1002/prot.1035] [PMID: 11288174]
[http://dx.doi.org/10.1002/jcc.20354] [PMID: 16429410]
[http://dx.doi.org/10.1093/bioinformatics/15.12.994] [PMID: 10745989]
[http://dx.doi.org/10.1073/pnas.0607879104] [PMID: 17360525]
[http://dx.doi.org/10.1016/S1093-3263(97)00106-X]] [PMID: 9640559]
[http://dx.doi.org/10.1093/nar/18.8.2163]] [PMID: 2336393]
[http://dx.doi.org/10.1016/j.jtbi.2008.12.027] [PMID: 19183559]
[http://dx.doi.org/10.1016/j.jtbi.2011.09.028] [PMID: 22001320]
[http://dx.doi.org/10.1142/S0219720005001004] [PMID: 15852500]
[http://dx.doi.org/10.2174/1574893611666151215213508]
[http://dx.doi.org/10.1007/s00726-011-0835-0] [PMID: 21267749]
[http://dx.doi.org/10.1016/j.neucom.2014.12.123]
[http://dx.doi.org/10.1186/1471-2105-10-S1-S21] [PMID: 19208121]
[http://dx.doi.org/10.1093/bioinformatics/btt172] [PMID: 23620357]
[http://dx.doi.org/10.1093/bioinformatics/16.10.906] [PMID: 11120680]
[http://dx.doi.org/10.1093/bioinformatics/btt029] [PMID: 23335013]
[http://dx.doi.org/10.1002/prot.21677] [PMID: 17932917]
[http://dx.doi.org/10.1186/1471-2105-14-S5-S12] [PMID: 23735199]
[http://dx.doi.org/10.1186/1471-2105-10-445] [PMID: 20030857]
[http://dx.doi.org/10.1016/j.biosystems.2007.01.001] [PMID: 17291684]
[http://dx.doi.org/10.2174/1574893613666180726163429]
[http://dx.doi.org/10.2174/157016461302160514000940]
[http://dx.doi.org/10.1093/bioinformatics/btz015] [PMID: 30624619]
[http://dx.doi.org/10.1371/journal.pone.0145541] [PMID: 26713618]
[http://dx.doi.org/10.1142/S1793524517500504]
[http://dx.doi.org/10.1186/1471-2105-15-120] [PMID: 24776231]
[http://dx.doi.org/10.3389/fmicb.2018.00476] [PMID: 29616000]
[http://dx.doi.org/10.3390/ijms20081964] [PMID: 31013619]
[http://dx.doi.org/10.1016/j.omtn.2019.04.019]
[http://dx.doi.org/10.1093/bioinformatics/btz408] [PMID: 31099381]
[http://dx.doi.org/10.1109/TIT.1967.1053964]
[http://dx.doi.org/10.21037/atm.2016.03.37] [PMID: 27386492]
[http://dx.doi.org/10.1016/j.jtbi.2019.03.011] [PMID: 30880183]
[http://dx.doi.org/10.1109/TCBB.2019.2903800]
[http://dx.doi.org/10.1016/j.isatra.2019.02.034] [PMID: 30850204]
[http://dx.doi.org/10.1016/j.jtbi.2014.10.014] [PMID: 25452135]
[http://dx.doi.org/10.1016/j.ab.2012.02.007] [PMID: 22342883]
[http://dx.doi.org/10.1123/mc.2016-0007] [PMID: 27736312]
[http://dx.doi.org/10.1155/2018/1458962] [PMID: 30532768]
[http://dx.doi.org/10.1109/72.80210] [PMID: 18282828]
[http://dx.doi.org/10.1016/j.ab.2007.07.006] [PMID: 17698024]
[http://dx.doi.org/10.3109/10409239509083488]] [PMID: 7587280]
[http://dx.doi.org/10.1155/2016/5413903] [PMID: 27597968]
[http://dx.doi.org/10.1039/C5MB00883B] [PMID: 26883492]
[http://dx.doi.org/10.1155/2016/1654623] [PMID: 27437396]
[http://dx.doi.org/10.1155/2013/567529] [PMID: 24062796]
[http://dx.doi.org/10.1155/2013/530696] [PMID: 23762187]
[PMID: 30378494]
[http://dx.doi.org/10.1016/j.ab.2014.04.032] [PMID: 24802134]
[http://dx.doi.org/10.1021/acs.jproteome.8b00148] [PMID: 29893128]
[http://dx.doi.org/10.1093/bib/bby124] [PMID: 30649170]
[http://dx.doi.org/10.1186/1471-2105-10-22] [PMID: 19152693]
[http://dx.doi.org/10.1074/jbc.M511061200] [PMID: 16339140]
[http://dx.doi.org/10.1002/jmr.1061] [PMID: 20677174]
[http://dx.doi.org/10.1186/1471-2105-8-463] [PMID: 18042272]
[http://dx.doi.org/10.1371/journal.pone.0089246] [PMID: 24586628]
[http://dx.doi.org/10.1371/journal.pone.0098345] [PMID: 24897370]
[http://dx.doi.org/10.1093/bioinformatics/btx479 ] [PMID: 28961687]
[http://dx.doi.org/10.1016/j.ab.2013.05.024] [PMID: 23756733]
[http://dx.doi.org/10.2174/092986609788681733] [PMID: 19601913]
[http://dx.doi.org/10.1089/cmb.2018.0004] [PMID: 30113871]
[http://dx.doi.org/10.7150/ijbs.24174] [PMID: 29989085]
[PMID: 30247625]
[PMID: 27543076]
[http://dx.doi.org/10.1093/nar/gkw1052] [PMID: 27899615]
[PMID: 28171531]
[http://dx.doi.org/10.1016/j.knosys.2018.10.007]
[http://dx.doi.org/10.1093/bib/bbz048] [PMID: 31157855]
[http://dx.doi.org/10.3934/mbe.2019123] [PMID: 31137222]
[http://dx.doi.org/10.1016/j.ab.2018.09.002] [PMID: 30201554]
[http://dx.doi.org/10.1093/bioinformatics/bty002] [PMID: 29365045]
[http://dx.doi.org/10.1093/bib/bbx103] [PMID: 28968812]
[http://dx.doi.org/10.1002/prot.25697] [PMID: 30985027]
[http://dx.doi.org/10.1371/journal.pone.0106542 ] [PMID: 25222008]
[http://dx.doi.org/10.3389/fphar.2018.00276] [PMID: 29636690]
[http://dx.doi.org/10.3390/ijms19092817] [PMID: 30231550]
[PMID: 27473064]
[http://dx.doi.org/10.1002/prot.25377] [PMID: 28845538]
[http://dx.doi.org/10.1261/rna.069112.118] [PMID: 30425123]
[http://dx.doi.org/10.1371/journal.pone.0047843] [PMID: 23144709]
[PMID: 30428009]