Abstract
Background: Anemia is a major public health problem with raising prevalence worldwide, including Bangladesh.
Objectives: To identify the risk factors of anemia among women in Bangladesh and its prediction using Machine Learning (ML) based techniques.
Methods: The anemia dataset, comprising of 3,020 respondents, was extracted from the Bangladesh Demographic and Health Survey (BDHS). Two feature selection techniques as Logistic Regression (LR) and Random Forest (RF), have been utilized to determine the risk factors of anemia. Additionally, eight ML-based techniques, namely LR, Linear Discriminant Analysis (LDA), K-Nearest Neighborhood (KNN), Support Vector Machine (SVM), Quadratic Discriminant Analysis (QDA), Neural Network (NN), Classification And Regression Tree (CART), and RF have also been utilized to predict anemia disease among women in Bangladesh. Classification accuracy and Area Under the Curve (AUC) are used to evaluate the performances of these classifiers.
Results: LR and RF-based feature selection results indicate that out of 15 factors, 13 for LR and 14 factors for RF appear to be significant risk factors for anemia among women. All predictive models provide the highest classification accuracy and AUC of 74.10-81.29% and 0.744-0.819 under RF features. However, the combination of RF-based feature selection along with RF-based classifier gives the highest classification accuracy (81.29%) and AUC (0.819).
Conclusion: Out of the eight predictive models, RF-RF based combination model shows the best performance for the prediction of anemia. This study suggests policymakers to make appropriate decisions to control the anemia using RF-RF combination to save time and reduce the cost for Bangladeshi women.
Keywords: Non-pregnant women of childbearing age, risk factors, identification, model, LR, RF, prediction, anemia, machine learning, Bangladesh.
Graphical Abstract
[http://dx.doi.org/10.1182/blood-2013-06-508325] [PMID: 24297872]
[http://dx.doi.org/10.1007/s00101-019-00707-3] [PMID: 31925453]
[http://dx.doi.org/10.15171/npj.2017.07]
[http://dx.doi.org/10.1111/j.1523-1755.2004.00863.x] [PMID: 15327408]
[http://dx.doi.org/10.1093/jn/131.2.676S] [PMID: 11160598]
[http://dx.doi.org/10.1371/journal.pone.0236449] [PMID: 32790764]
[http://dx.doi.org/10.1016/S2352-3026(18)30004-8] [PMID: 29406148]
[http://dx.doi.org/10.1556/oh.2010.28887] [PMID: 20693146]
[http://dx.doi.org/10.1093/ajcn/55.5.985] [PMID: 1570808]
[http://dx.doi.org/10.1111/j.1447-0756.2008.00980.x] [PMID: 19527381]
[http://dx.doi.org/10.1093/jn/131.2.590S] [PMID: 11160592]
[http://dx.doi.org/10.1093/jn/131.2.604S] [PMID: 11160593]
[http://dx.doi.org/10.1016/j.trstmh.2007.09.015] [PMID: 17996912]
[http://dx.doi.org/10.2174/1573404813666170306163448] [PMID: 29861704]
[http://dx.doi.org/10.1681/ASN.2005030226] [PMID: 16162813]
[http://dx.doi.org/10.4239/wjd.v5.i4.444] [PMID: 25126392]
[http://dx.doi.org/10.1007/s10916-018-0940-7] [PMID: 29637403]
[http://dx.doi.org/10.2174/1573404813666170921162041] [PMID: 29861705]
[http://dx.doi.org/10.1016/j.compbiomed.2018.08.017] [PMID: 30149250]
[http://dx.doi.org/10.1016/j.imu.2019.100203]
[http://dx.doi.org/10.1016/j.nut.2020.110861] [PMID: 32592978]
[http://dx.doi.org/10.1016/j.compbiomed.2017.10.019] [PMID: 29100114]
[http://dx.doi.org/10.1007/s10916-017-0797-1] [PMID: 28836045]
[http://dx.doi.org/10.1016/j.compbiomed.2016.11.011] [PMID: 27915126]
[http://dx.doi.org/10.1016/j.cmpb.2019.04.008] [PMID: 31200905]
[http://dx.doi.org/10.1016/j.dsx.2020.04.012] [PMID: 32305024]
[http://dx.doi.org/10.1007/978-981-13-2685-1_44]
[http://dx.doi.org/10.1007/s10916-011-9668-3] [PMID: 21503744]
[http://dx.doi.org/10.1007/978-3-642-27242-4_14]
[http://dx.doi.org/10.1186/s12905-015-0211-4] [PMID: 26219633]
[http://dx.doi.org/10.1186/s12889-020-09252-w] [PMID: 32680488]
[http://dx.doi.org/10.1371/journal.pone.0218288] [PMID: 31188883]
[PMID: 19052339]
[http://dx.doi.org/10.1177/1010539509350913] [PMID: 20032040]
[http://dx.doi.org/10.1016/j.jclinepi.2004.04.003] [PMID: 15567629]
[http://dx.doi.org/10.7189/jogh.08.010421] [PMID: 29740501]
[http://dx.doi.org/10.1109/34.990133]
[http://dx.doi.org/10.1109/ICSEM.2010.14]
[http://dx.doi.org/10.1016/j.cmpb.2017.07.011] [PMID: 28859832]
[http://dx.doi.org/10.1109/TPAMI.2005.159] [PMID: 16119262]
[http://dx.doi.org/10.1007/s00521-013-1368-0]
[http://dx.doi.org/10.19026/rjaset.7.299]
[http://dx.doi.org/10.1016/j.patrec.2010.03.014]
[http://dx.doi.org/10.1093/bib/bbx124] [PMID: 29045534]
[http://dx.doi.org/10.1001/jama.2016.7653] [PMID: 27483067]
[http://dx.doi.org/10.4236/jis.2016.73009]
[http://dx.doi.org/10.2337/diacare.25.11.1999] [PMID: 12401746]
[http://dx.doi.org/10.1016/0304-4076(81)90060-9]
[http://dx.doi.org/10.1111/j.1467-985X.2005.00368_10.x]
[http://dx.doi.org/10.1016/j.patrec.2004.09.007]
[http://dx.doi.org/10.1007/978-0-387-84858-7]
[http://dx.doi.org/10.1007/BF00994018]
[http://dx.doi.org/10.1186/1472-6947-10-16] [PMID: 20307319]
[http://dx.doi.org/10.1504/IJAPR.2016.079050]
[http://dx.doi.org/10.4097/kjae.2016.69.1.8] [PMID: 26885295]
[http://dx.doi.org/10.1142/S0129065793000171] [PMID: 8293227]
[http://dx.doi.org/10.1093/nar/26.9.2230] [PMID: 9547285]
[http://dx.doi.org/10.1002/widm.8]
[http://dx.doi.org/10.1023/A:1016409317640] [PMID: 12182209]
[http://dx.doi.org/10.1023/A:1010933404324]
[http://dx.doi.org/10.2174/0929866527666200610141258] [PMID: 32520672]
[http://dx.doi.org/10.1016/j.dsx.2020.03.004] [PMID: 32193086]
[http://dx.doi.org/10.1177/0272989X9101100205] [PMID: 1865776]
[http://dx.doi.org/10.1155/2014/391580] [PMID: 27355074]
[http://dx.doi.org/10.1038/sj.ejcn.1601267] [PMID: 11781673]
[http://dx.doi.org/10.26719/emhj.18.074] [PMID: 31612968]