Abstract
Introduction: The goal of this study is to expand the diabetes decision-making framework through the advancement of computational intelligence. Several artificial network and machine- learning-based methods have been developed and validated, most of which are based on the Pima Indian dataset. So far, no method has reached an accuracy of 99-100%. Various tools such as Machine Learning (ML) and Data Mining are used for the correct identification of diabetes. These tools improve the diagnostic process associated with T2DM. Diabetes mellitus type 2 (DMT2) is a major problem in several developing countries, and its early diagnosis can save several people’s lives. Accordingly, we have to build a structure that diagnoses type 2 diabetes. This paper proposes a fuzzy expert system that uses the Mamdani fuzzy inference structure (MFIS) to diagnose type 2 diabetes accurately. The proposed research work has been created using a variety of machine learning algorithms such as J48 Decision-tree (DT), Multilayer perceptron (MLP), Support-vector-machine (SVM), Naive-Bayes (NB), Fusion, and Mixed fusion-based. Actual data from the UCI machine learning datasets are used to validate the advanced Fuzzy expert system (FES) and machine learning algorithms.
Objective: A review of recent advances in machine learning-based classification models for diabetes diagnosis is presented in this survey paper.
Methods: This paper compares modified fusion processes to fundamental models such as radial basis function, K-nearest neighbor, support vector machine, J48, logistic regression, classification, regression trees, etc., for diagnosing type 2 diabetes.
Results: Figs. 3 and 4 show the results for each classifier based on prediction accuracy.
Conclusion: The fuzzy expert system is the best among its rival classifiers. SVM performs very poorly with a very low true positive rate, i.e., a very high number of positive cases misclassified as (non-diabetic) negative. Based on the evaluation, it is clear that the fuzzy expert system has the highest precision value. However, J48 is the least accurate classifier. Compared to the other classifiers listed in the testing section, it has the greatest number of false positives. The results show that the fuzzy expert system has the uppermost cost for both precision and recall. Thus, it has the uppermost value for F-measure in the training and testing datasets. J48 is considered the secondbest classifier for the training dataset, whereas Naïve Bayes comes in the second rank in the testing dataset.
Keywords: Diabetes judgement, diabetes mellitus, computational–procedure, machine learning, decision-tree, J48, SVM, MLP, NB, FES, MFIS.
Graphical Abstract
[http://dx.doi.org/10.1016/S1098-3597(98)90014-X] [PMID: 10682169]
[http://dx.doi.org/10.1016/j.diabres.2013.07.008] [PMID: 23928565]
[http://dx.doi.org/10.1111/j.1753-6405.2009.00450.x] [PMID: 20078571]
[http://dx.doi.org/10.1023/A:1012474916001]
[http://dx.doi.org/10.1038/s41598-019-46631-9] [PMID: 31300715]
[http://dx.doi.org/10.2147/CLEP.S113415] [PMID: 27785102]