A Machine Learning-based Self-risk Assessment Technique for Cervical Cancer

Zeeshan       Ramzan; Muhammad    Awais    Hassan; H. M.    Shahzad    Asif; Amjad       Farooq

doi:10.2174/1574893615999200608130538

Abstract

Background: Cervical cancer is a highly significant cause of mortality in developing countries, and it is one of the most prominent forms of cancer worldwide. Machine learning techniques have been proven more accurate for the identification of cervical cancer as compared to the manual screening methods like Pap smear and Liquid Cytology Based (LCB) tests.

Objective: Primarily, these machine-learning techniques use the images of the cervix for cervical cancer risk analysis; in this article, demographic data and medical records of patients are used to identify major causes of cervical cancer. Furthermore, normal classification methods are used as a usual way of classification when the dataset is balanced as this dataset has abundant examples of negative cases as compared to positive cases On the other hand, traditional binary class classifiers are not sufficient to classify the examples of cervical cancer correctly.

Methods: We identified the major causes of cervical cancer by employing multiple machine learning feature selection algorithms. After this selection, we trained different machine learning methods including Decision Trees (DTs), Support Vector Machines (SVMs) and Ensemble Learners using all features as well as these important features.

Results and Conclusion: AdaBoost is able to classify instances into healthy and unhealthy classes of this unbalanced dataset with 96% accuracy. Based on this model and significant causes of cervical cancer, we aimed to develop a technique for self-risk assessment of cervical cancer, which women can use to know their chances of being infected from cervical cancer after answering some questions about their demographics and medical history.

Keywords: Cervical cancer, causes of cervical cancer, feature selection in machine learning, ensemble learning, cancer prediction using machine learning, AdaBoost.

« Previous Next »

Graphical Abstract

[1] 
Latha DS, Lakshmi P, Fathima S. Staging prediction in cervical cancer patients-a machine learning approach. Int J Innov Res Prac  2014; 2(2): 14-23.
[2] 
Sharma S. Cervical cancer stage prediction using decision tree approach of machine learning. Int J Adv Res Comput Commun Eng  2016; 5(4): 345-8.
[3] 
Luiz H, Lorena N, André C, Carvalho F, Lorena AC. Filter feature selection for one-class classification. J Intell Robot Syst  2015; 80: 227.
[http://dx.doi.org/10.1007/s10846-014-0101-2] 
[4] 
Bellinger C, Sharma S, Japkowicz N. One-class versus binary classification: Which and when?Book One-class versus binary classification: Which and when?.  IEEE 2012; pp. 102-6.
[http://dx.doi.org/10.1109/ICMLA.2012.212] 
[5] 
Sharma S, Gupta S. Decision tree approach in machine learning for prediction of cervical cancer stages using WEKA. Int J Recent Trends Eng Res  2016; 2(8): 74-83.
[6] 
Fernandes K, Cardoso JS, Fernandes J. Transfer learning with partial observability applied to cervical cancer screeningBook Transfer learning with partial observability applied to cervical cancer screening.  Springer 2017; pp. 243-50.
[http://dx.doi.org/10.1007/978-3-319-58838-4_27] 
[7] 
Song Q, Ni J, Wang G. A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans Knowl Data Eng  2013; 25(1): 1-14.
[http://dx.doi.org/10.1109/TKDE.2011.181] 
[8] 
Abdi H, Williams LJ. Principal component analysis. Wiley Interdiscip Rev Comput Stat  2010; 2(4): 433-59.
[http://dx.doi.org/10.1002/wics.101] 
[9] 
Johannes M, Brase JC, Fröhlich H, et al. Integration of pathway knowledge into a reweighted recursive feature elimination approach for risk stratification of cancer patients. Bioinformatics  2010; 26(17): 2136-44.
[http://dx.doi.org/10.1093/bioinformatics/btq345 PMID: 20591905] 
[10] 
Ben-Hur A, Weston J. ‘A user’s guide to support vector machines’: ‘Data mining techniques for the life sciences.  Springer 2010; pp. 223-39.
[http://dx.doi.org/10.1007/978-1-60327-241-4_13] 
[11] 
Aldian RD, Purwanti E, Bustomi MA. Applied computing based artificial neural network for classification of cervical cancer. CISAK 2013: The 6th Conference of Indonesian Students Association in KoreaAt: KAIST, Daejeon, Korea 2013.. 
[12] 
Sokouti B, Haghipour S, Tabrizi AD. A framework for diagnosing cervical cancer disease based on feedforward MLP neural network and ThinPrep histopathological cell image features. Neural Comput Appl  2014; 24(1): 221-32.
[http://dx.doi.org/10.1007/s00521-012-1220-y] 
[13] 
Kusy M, Obrzut B, Kluska J. Application of gene expression programming and neural networks to predict adverse events of radical hysterectomy in cervical cancer patients. Med Biol Eng Comput  2013; 51(12): 1357-65.
[http://dx.doi.org/10.1007/s11517-013-1108-8 PMID: 24136688] 
[14] 
Devi MA, Ravi S, Vaishnavi J, Punitha S. Classification of cervical cancer using artificial neural networks. Procedia Comput Sci  2016; 89: 465-72.
[http://dx.doi.org/10.1016/j.procs.2016.06.105] 
[15] 
Mariarputham EJ, Stephen A. Nominated texture based cervical cancer classification. Comput Math Methods Med  2015; 2015586928
[http://dx.doi.org/10.1155/2015/586928] 
[16] 
Wieslander H, Forslid G, Bengtsson E, et al. Deep convolutional neural networks for detecting cellular changes due to malignancybook deep convolutional neural networks for detecting cellular changes due to malignancy  2017; 82-9.
[17] 
Ceylan Z, Pekel E. Comparison of multi-label classification methods for prediagnosis of cervical cancer 2017 ; 21: 22..
[http://dx.doi.org/10.18201/ijisae.2017533896] 
[18] 
Fatlawi HK. Enhanced classification model for cervical cancer dataset based on cost sensitive classifier. Int J Comput Technol  2017; 4(4): 115-20.
[19] 
Tseng C-J, Lu C-J, Chang C-C, Chen G-D, Cheewakriangkrai C. Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence. Artif Intell Med  2017; 78: 47-54.
[http://dx.doi.org/10.1016/j.artmed.2017.06.003 PMID: 28764872] 
[20] 
Wu W, Zhou H. Data-driven diagnosis of cervical cancer with support vector machine-based approaches IEEE Access 2017; 5: 25189-95. 
[http://dx.doi.org/10.1109/ACCESS.2017.2763984] 
[21] 
Abdoh SF, Rizka MA, Maghraby FA. Cervical cancer diagnosis using random forest classifier with SMOTE and feature reduction techniques 
[22] 
Fernandes K, Chicco D, Cardoso JS, Fernandes J. Supervised deep learning embeddings for the prediction of cervical cancer diagnosis. PeerJ Comput Sci  2018; 4e154
[http://dx.doi.org/10.7717/peerj-cs.154] 
[23] 
Al-Wesabi Y, Choudhury A, Won D. Classification of cervical cancer datasetBook Classification of cervical cancer dataset  2018 pp. 1455-61..
[24] 
Sawhney R, Mathur P, Shankar R. A firefly algorithm based wrapper-penalty feature selection method for cancer diagnosisBook A firefly algorithm based wrapper-penalty feature selection method for cancer diagnosis.  Springer 2018; pp. 438-49.
[http://dx.doi.org/10.1007/978-3-319-95162-1_30] 
[25] 
Alam TM, Khan MMA, Iqbal MA, Abdul W, Mushtaq M. Cervical cancer prediction through different screening methods using data mining. Int J Adv Comput Sci Appl  2019; 10(2): 388-96.
[http://dx.doi.org/10.14569/IJACSA.2019.0100251] 
[26] 
Ashraf FB, Momo NS. Comparative analysis on Prediction Models with various Data Preprocessings in the Prognosis of Cervical CancerBook Comparative analysis on Prediction Models with various Data Preprocessings in the Prognosis of Cervical Cancer 2019.; pp. 1-6..
[27] 
Jain R, Sangwan SR, Bachhety S, Garg S, Upadhyay Y. Optimized model for cervical cancer detection using binary cuckoo search. Recent Pat Comput Sci  2019; 12(4): 293-303.
[http://dx.doi.org/10.2174/2213275911666181120092223] 
[28] 
Karim E, Neehal N. An Empirical study of cervical cancer diagnosis using ensemble MethodsBook an empirical study of cervical cancer diagnosis using ensemble methods.  IEEE 2019; pp. 1-5.
[http://dx.doi.org/10.1109/ICASERT.2019.8934464] 
[29] 
Li F-Q, Wang S-L, Liu G-S. A Bayesian possibilistic c-means clustering approach for cervical cancer screening. Inf Sci  2019; 501: 495-510.
[http://dx.doi.org/10.1016/j.ins.2019.05.089] 
[30] 
Ripon SH, Bhuiyan NQ. Cervical cancer risk factors: classification and mining associations. APTIKOM J Comput Sci Inform Technol  2019; 4(1): 8-18.
[http://dx.doi.org/10.11591/APTIKOM.J.CSIT.131] 
[31] 
Yang W, Gou X, Xu T, Yi X, Jiang M. Cervical Cancer Risk Prediction Model and Analysis of Risk Factors based on Machine LearningBook Cervical Cancer Risk Prediction Model and Analysis of Risk Factors based on Machine Learning .; pp. 50-4.. 
[http://dx.doi.org/10.1145/3340074.3340078] 
[32] 
Lu J, Song E, Ghoneim A, Alrashoud M. Machine learning for  assisting cervical cancer diagnosis: An ensemble approach. Future
  Gener Comput Syst 2020; 106: 199-205.. 
[http://dx.doi.org/10.1016/j.future.2019.12.033] 
[33] 
Singh S, Tejaswini V, Murthy RP, Mutgi A. ‘Neural network based automated system for diagnosis of cervical cancer’: ‘Deep Learning and Neural Networks: Concepts, Methodologies, Tools, and Applications.  IGI Global 2020; pp. 1422-36.

Rights & Permissions Print Cite

Article Metrics

15

1

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/1574893615999200608130538	Print ISSN 1574-8936
Publisher Name Bentham Science Publisher	Online ISSN 2212-392X

Current Bioinformatics

A Machine Learning-based Self-risk Assessment Technique for Cervical Cancer

Abstract Play Pause

Graphical Abstract

Related Journals

Related Books

Related Articles

Abstract