Generic placeholder image

Recent Advances in Computer Science and Communications

Editor-in-Chief

ISSN (Print): 2666-2558
ISSN (Online): 2666-2566

Research Article

Prediction of Breast Cancer Using Machine Learning

Author(s): Somil Jain* and Puneet Kumar

Volume 13, Issue 5, 2020

Page: [901 - 908] Pages: 8

DOI: 10.2174/2213275912666190617160834

Price: $65

conference banner
Abstract

Background: Breast cancer is one of the diseases which cause number of deaths ever year across the globe, early detection and diagnosis of such type of disease is a challenging task in order to reduce the number of deaths. Now a days various techniques of machine learning and data mining are used for medical diagnosis which has proven there metal by which prediction can be done for the chronic diseases like cancer which can save the life’s of the patients suffering from such type of disease. The major concern of this study is to find the prediction accuracy of the classification algorithms like Support Vector Machine, J48, Naïve Bayes and Random Forest and to suggest the best algorithm.

Objective: The objective of this study is to assess the prediction accuracy of the classification algorithms in terms of efficiency and effectiveness.

Methods: This paper provides a detailed analysis of the classification algorithms like Support Vector Machine, J48, Naïve Bayes and Random Forest in terms of their prediction accuracy by applying 10 fold cross validation technique on the Wisconsin Diagnostic Breast Cancer dataset using WEKA open source tool.

Results: The result of this study states that Support Vector Machine has achieved the highest prediction accuracy of 97.89 % with low error rate of 0.14%.

Conclusion: This paper provides a clear view over the performance of the classification algorithms in terms of their predicting ability which provides a helping hand to the medical practitioners to diagnose the chronic disease like breast cancer effectively.

Keywords: Health Care, ICT, breast cancer, machine learning, classification, data mining.

Graphical Abstract

[1]
A. Sarwar, J. Manhas, and V. Sharma, ICT in Healthcare.In The Stances of Government Policies., Processes and Technologies, 2018, pp. 31-40.
[2]
P. Ramachandran, P.N. Girija, and T. Bhuvaneswari, "Early detection and prevention of cancer using data mining techniques", Int. J. Comput. Appl., vol. 97, no. 13, pp. 48-53, 2014.
[3]
M.Á. Gandarillas, and N. Goswami, Merging current health care trends: Innovative perspective in aging care Clin. Interv. Aging,, vol. 13,. pp. 2083-2095. 2018
[http://dx.doi.org/10.2147/CIA.S177286] [PMID: 30425463]
[4]
T. Intarajak, and S.H. Kang, Breast cancer decision support system for rural people , Int. J. Comput. Internet Manag. . vol 17, no. SP1, pp. 47 , pp. 1-47. 2009
[5]
H. Karim, and K. Zand, "A comparative survey on data mining techniques for breast cancer diagnosis and prediction", Indian J. Fundamental Appl. Life Sci., vol. 5, no. S1, pp. 4330-4339, 2015.
[6]
K. Williams, P.A. Idowu, J.A. Balogun, and A.I. Oluwaranti, Breast cancer risk prediction using data mining classification techniques Trans. Netw. Commun., , vol. 3, no. 2,, pp. 1-11. 2015
[http://dx.doi.org/10.14738/tnc.32.662]
[7]
A. Ahmadi, and P. Afshar, Intelligent breast cancer recognition using particle swarm optimization and support vector machines J. Exp. Theor. Artif. Intell., , vol. 28, no. 6, , pp. 1021-1034. 2016
[http://dx.doi.org/10.1080/0952813X.2015.1055828]
[8]
V. Chaurasia, S. Pal, and B.B. Tiwari, Prediction of benign and malignant breast cancer using data mining techniques J. Algorithm. Comput. Technol., , vol. 12, no. 2, , pp. 119-126. 2018
[http://dx.doi.org/10.1177/1748301818756225]
[9]
T. Tran, and U. Le, "Predicting breast cancer risk: A data mining approach", In Proceedings of the International Conference on the Development of Biomedical Engineering, pp. 223-228 2017
[10]
G.R. Kumar, G.A. Ramachandra, and K. Nagamani, "An efficient prediction of breast cancer data using data mining techniques", Int. J. Innov. Eng. Technol., vol. 2, no. 4, pp. 139-144, 2013.
[11]
H. Asria, H. Mousannifb, H. Al Moatassimec, and T. Noeld, "Using machine learning algorithms for breast cancer risk prediction and diagnosis", In Proceedings of the 6th International Symposium on Frontiers in Ambient and Mobile Systems Procedia Computer Science, vol. 83, pp. 1064-1069 2016
[http://dx.doi.org/10.1016/j.procs.2016.04.224]
[12]
G. Ogbuabor, and F.N. Ugwoke, Clustering algorithm for a healthcare dataset using silhouette score value Int. J. Comput. Sci. Informat. Technol., vol. 10, no. 2,, pp. 27-37. 2018
[http://dx.doi.org/10.5121/ijcsit.2018.10203]
[13]
N. Jothi, A.A. Rashid, and W. Husain, "Data mining in healthcare. A review", In Proceedings of the 3rd Information System International Conference, vol. 72, pp. 306-313 2015
[http://dx.doi.org/10.1016/j.procs.2015.12.145]
[14]
J.S. Saleema, P.D. Shenoy, K.R. Venugopal, and L.M. Patnaik, "Cancer prognosis prediction model using data mining techniques", Data Mining Knowl. Eng., vol. 6, no. 1, pp. 21-29, 2014.
[15]
M. Kantardzic, Data mining: Concepts, models, methods, and algorithms., John Wiley & Sons, 2003.
[16]
O. Maimon, and L. Rokach, Data mining and knowledge discovery handbook., vol. 2. Springer: New York, 2005.
[17]
X. Wu, V. Kumar, J.R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G.J. McLachlan, A. Ng, B. Liu, S.Y. Philip, and Z.H. Zhou, Top 10 algorithms in data mining Knowl. Inf. Syst., , vol. 14, no. 1,, pp. 1-37. 2008
[http://dx.doi.org/10.1007/s10115-007-0114-2]
[18]
UCI Machine Learning Repository, Flags Data Set. Available from:.https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29 [Accessed: April 18, 2019].
[19]
D.M. Liou, and W.P. Chang, Applying data mining for the analysis of breast cancer data.In Data Mining in Clinical Medicine., Humana Press: New York, NY, pp. 175-189. 2015
[http://dx.doi.org/10.1007/978-1-4939-1985-7_12]
[20]
L.G. Ahmad, A.T. Eshlaghy, A. Poorebrahimi, M. Ebrahimi, and R. Amir, "Using three machine learning techniques for predicting breast cancer recurrence", J. Health Med. Inform., vol. 4, no. 2, pp. 1-3, 2013.
[21]
V. Chaurasia, and S. Pal, "Data mining techniques: To predict and resolve breast cancer survivability", Int. J. Comput. Sci. Mobile Comput., vol. 3, pp. 10-22, 2014.
[22]
V. Krishnaiah, G. Narsimha, and N. Subhash Chandra, "Diagnosis of lung cancer prediction system using data mining classification techniques", Int. J. Comput. Sci. Inf. Technol., vol. 4, no. 1, pp. 39-45, 2013.
[23]
A. Sahar, M. Alaa, and M. Elsayad, "Predicting the severity of breast masses with data mining methods", Int. J. Comput. Sci., vol. 10, no. 2, 2013.
[24]
H. Wang, and S.W. Yoon, "Breast cancer prediction using data mining method", In Proceedings of the Industrial and Systems Engineering Research Conference, pp. 818-828 2015
[25]
I.H. Witten, and E. Frank, "Data mining: Practical machine learning tools and techniques with Java implementations", ACM Sigmod Record , vol. 31, no. 1, pp. 76-77, 2002.
[26]
S.K. Yadav, and S. Pal, "Data mining: A prediction for performance improvement of engineering students using classification", World Comput. Sci. Inf. Technol. J., vol. 2, no. 2, pp. 51-56, 2012.
[27]
S.K. Yadav, B. Bharadwaj, and S. Pal, "Data mining applications: A comparative study for predicting student’s performance", Int. J. Innov. Technol. Creative Eng., vol. 1, no. 12, pp. 13-19, 2012.
[28]
T.M. Mitchell, Machine Learning., McGraw-Hill Science/Engineering/Math, 1997.
[29]
J.R. Quinlan, "Induction of decision trees", Mach. Learn., vol. 1, no. 1, pp. 81-106, 1986.
[http://dx.doi.org/10.1007/BF00116251]
[30]
E. Goel, E. Abhilasha, E. Goel, and E. Abhilasha, "Random forest: A review", Int. J. Adv. Res. Comput. Sci. Softw. Eng., vol. 7, no. 1, pp. 251-257, 2017.
[http://dx.doi.org/10.23956/ijarcsse/V7I1/01113]
[31]
K. Fawagre, M.M. Gaber, and E. Elyan, "Random forests: From early developments to recent advancements", Syst. Sci. Control Eng. J., vol. 2, no. 1, pp. 602-609, 2014.
[http://dx.doi.org/10.1080/21642583.2014.956265]
[32]
L. Breiman, "Random forests", Mach. Learn., vol. 45, no. 1, pp. 5-32, 2001.
[http://dx.doi.org/10.1023/A:1010933404324]
[33]
I.S. Gouda, M.B. Abdelhalim, and M.A. Zeid, "Breast cancer diagnosis on three different datasets using multi-classifiers", Int. J. Comput. Inf. Technol., vol. 1, no. 1, pp. 36-43, 2012.
[34]
S. Aruna, D.S. Rajagopalan, and L.V. Nandakishore, "Knowledge based analysis of various statistical tools in detecting breast cancer", Comput. Sci. Inf. Technol., vol. 2, pp. 37-45, 2011.
[35]
D. Lavanya, and K.U. Rani, "Analysis of feature selection with classification: Breast cancer datasets", Indian J. Comput. Sci. Eng., vol. 2, no. 5, pp. 756-763, 2011.

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy