Study on Intelligent Analysis and Processing Technology of Computer Big
Data Based on Clustering Algorithm

Xiaoming      Liu; Md      Rokunojjaman; Rakesh      Kumar ER; Ragimova      Nazila; Abdullayev      Vugar

doi:10.2174/2352096515666220823093929

Abstract

Aim: Clustering belongs to unsupervised learning, which divides the data objects into the data set into multiple clusters or classes, so that the objects in the same cluster have high similarity.

Background: The clustering of spatial data objects can be solved by optimization based on the clustering objective function.

Objective: Study on intelligent analysis and processing technology of computer big data based on clustering algorithm.

Methods: First, a new dynamic self-organizing feature mapping model is proposed, and the training algorithm of the model is given. Then, the spectral clustering technology and related concepts are introduced. The spectral clustering algorithm is studied and analyzed, and a spectral clustering algorithm that automatically determines the number of clusters is proposed. Furthermore, an algorithm for constructing a discrete Morse function to find the optimal solution is proposed, proving that the constructed function is the optimal discrete Morse function. At the same time, two optimization models based on the discrete Morse theory are constructed. Finally, the optimization model based on discrete Morse theory is applied to cluster analysis, and a density clustering algorithm based on the discrete Morse optimization model is proposed.

Results: This study is focused on designing and implementing a partitional-based clustering algorithm based on big data, that is suitable for clustering huge datasets to meet low computational requirements. The experiments are conducted in terms of time and space complexity and it is observed that the measure of clustering quality and the run time is capable of running in very less time without negotiating the quality of clustering. The results show that the experiments are carried out on the artificial data set and the UCI data set.

Conclusion: The efficiency and superiority of the new model, are verified by comparing it with the clustering results of the DBSCAN algorithm.

Keywords: Computation intelligence, cluster analysis, genetic algorithm, big data.

« Previous Next »

[1]
H. Sun, Z. Liu, G. Wang, W. Lian,  and J. Ma, "Intelligent analysis of medical big data based on deep learning", IEEE Access, vol. 7, pp. 142022-142037, 2019.
 [http://dx.doi.org/10.1109/ACCESS.2019.2942937]
[2]
T. Zhang,  and X. Wu, "Research on intelligent logistics development model based on internet of things and cloud computing in big data age", Rev. Fac. Ing., vol. 32, no. 6, pp. 341-346, 2017.
[3]
G. Verma, N. Pathak,  and N. Sharma, "A secure framework for health record management using blockchain in cloud environment", J. Phys. Conf. Ser., vol. 1998, no. 1, p. 012019, 2021.
 [http://dx.doi.org/10.1088/1742-6596/1998/1/012019]
[4]
S. Gupta, S. Vyas,  and K.P. Sharma, A survey on security for IoT via machine learning2020 International Conference on Computer Science, Engineering and Applications (ICCSEA) 13-14 Mar, 2020,, Gunupur, India, 2020, pp. 1-5.
 [http://dx.doi.org/10.1109/ICCSEA49143.2020.9132898]
[5]
G. Rastogi, S. Narayan, G. Krishan,  and R. Sushil, Deployment of cloud using open-source virtualization: Study of VM migration methods and benefits. Big Data Analytics., Springer: Singapore, 2018, pp. 553-563.
 [http://dx.doi.org/10.1007/978-981-10-6620-7_53]
[6]
L. Li, J. Wang,  and X. Li, "Efficiency analysis of machine learning intelligent investment based on K-means algorithm", IEEE Access, vol. 8, pp. 147463-147470, 2020.
 [http://dx.doi.org/10.1109/ACCESS.2020.3011366]
[7]
Z. Tian,  and S. Zhang, "Application of big data optimized clustering algorithm in cloud computing environment in traffic accident forecast", Peer-to-Peer Netw. Appl., vol. 14, no. 4, pp. 2511-2523, 2021.
 [http://dx.doi.org/10.1007/s12083-020-00994-3]
[8]
W.W. Ng, J. Hu, D.S. Yeung, S. Yin,  and F. Roli, "Diversified sensitivity-based undersampling for imbalance classification problems", IEEE Trans. Cybern., vol. 45, no. 11, pp. 2402-2412, 2015.
 [http://dx.doi.org/10.1109/TCYB.2014.2372060] [PMID:  25474818]
[9]
D. Wang, M. Zhou, S. Ali, P. Zhou, Y. Liu,  and X. Wang, "A novel complex event processing engine for intelligent data analysis in integrated information systems", Int. J. Distrib. Sens. Netw., vol. 12, no. 3, p. 6741401, 2016.
 [http://dx.doi.org/10.1155/2016/6741401]
[10]
Z. Xing,  and G. Li, "Intelligent classification method of remote sensing image based on big data in spark environment", Int. J. Wirel. Inf. Netw., vol. 26, no. 3, pp. 183-192, 2019.
 [http://dx.doi.org/10.1007/s10776-019-00440-z]
[11]
Z. Xu, D. Shi,  and Z. Tu, "Research on diagnostic information of smart medical care based on big data", J. Healthc. Eng., vol. 2021, p. 9977358, 2021.
 [http://dx.doi.org/10.1155/2021/9977358] [PMID:  34188793]
[12]
F. Shi,  and L. Zhu, "Analysis of trip generation rates in residential commuting based on mobile phone signaling data", J. Transp. Land Use, vol. 12, no. 1, pp. 201-220, 2019.
 [http://dx.doi.org/10.5198/jtlu.2019.1431]
[13]
X. Wendong, L. Yuanfeng,  and C. Deli, "“Algorithm of key data ensemble clustering and approximate analysis in cloud computing”, Int. J. Reason. based Intell", Syst., vol. 9, no. 3-4, pp. 177-184, 2017.
 [http://dx.doi.org/10.1504/IJRIS.2017.090038]
[14]
Y. Zhang, K. Liang, Y. Liu,  and Y. He, "The power big data-based energy analysis for intelligent community in smart grid", Int. J. Embed. Syst., vol. 11, no. 3, pp. 295-305, 2019.
 [http://dx.doi.org/10.1504/IJES.2019.099417]
[15]
F.H. Tseng, H.H. Cho,  and H.T. Wu, "Applying big data for intelligent agriculture based crop selection analysis", IEEE Access, vol. 7, pp. 116965-116974, 2019.
 [http://dx.doi.org/10.1109/ACCESS.2019.2935564]
[16]
H. Hu, B. Tang, X. Gong, W. Wei,  and H. Wang, "Intelligent fault diagnosis of the high-speed train with big data based on deep neural networks", IEEE Trans. Industr. Inform., vol. 13, no. 4, pp. 2106-2116, 2017.
 [http://dx.doi.org/10.1109/TII.2017.2683528]
[17]
A. Chakeri, I. Nekooimehr,  and L.O. Hall, "Dempster-shafer theory of evidence in single pass fuzzy c means", 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) 07-10 Jul 2013, Hyderabad, India, pp. 1-5, .
 [http://dx.doi.org/10.1109/FUZZ-IEEE.2013.6622476]
[18]
X. Wang, H. Du,  and J. Tan, "Online fault diagnosis for biochemical process based on FCM and SVM", Interdiscip. Sci., vol. 8, no. 4, pp. 419-424, 2016.
 [http://dx.doi.org/10.1007/s12539-016-0172-9] [PMID:  27129944]
[19]
Y. Wang, L. Chen,  and J.P. Mei, "Incremental fuzzy clustering with multiple medoids for large data", IEEE Trans. Fuzzy Syst., vol. 22, no. 6, pp. 1557-1568, 2014.
 [http://dx.doi.org/10.1109/TFUZZ.2014.2298244]
[20]
P. D’Urso, L. De Giovanni,  and R. Massari, "Robust fuzzy clustering of multivariate time trajectories", Int. J. Approx. Reason., vol. 99, pp. 12-38, 2018.
 [http://dx.doi.org/10.1016/j.ijar.2018.05.002]
[21]
S. Du,  and J. Li, ""Parallel processing of improved KNN text classification algorithm based on Hadoop"", 2019 7th international conference
on information, Communication and Networks (ICICN), 24-
26 Apr, 2019, Macao, China, pp. 167-170, 2019.
 [http://dx.doi.org/10.1109/ICICN.2019.8834973]
[22]
M. Cai,  and Y. Liang, "An improved CURE algorithm", International
conference on intelligence science, 15-17 Dec, 2021, Las, Vegas, NV, USA, pp. 102-111, 2021.
[23]
H. Zeng, G. Dhiman, A. Sharma, A. Sharma,  and A. Tselykh, "An IoT and Blockchain-based approach for the smart water management system in agriculture", Expert Syst., p. 12892, 2021.
 [http://dx.doi.org/10.1111/exsy.12892]
[24]
A. Sharma,  and P.K. Singh, "UAV&‐based framework for effective data analysis of forest fire detection using 5G networks: An effective approach towards smart cities solutions", Int. J. Commun. Syst., p. 4826, 2021.
 [http://dx.doi.org/10.1002/dac.4826]
[25]
P.K. Singh,  and A. Sharma, "An intelligent WSN UAV based IoT framework for precision agriculture application", Comput. Electr. Eng., vol. 100, p. 107912, 2022.
 [http://dx.doi.org/10.1016/j.compeleceng.2022.107912]
[26]
A. Sharma, P.K. Singh,  and Y. Kumar, "An integrated fire detection system using IoT and image processing technique for smart cities", Sustain Cities Soc., vol. 61, p. 102332, 2020.
 [http://dx.doi.org/10.1016/j.scs.2020.102332]
[27]
A.A. Agafonov, A.S. Yumaganov,  and V.V. Myasnikov, "Big data analysis in a geoinformatic problem of short-term traffic flow forecasting based on ak nearest neighbors method", Komput. Opt., vol. 42, no. 6, pp. 1101-1111, 2018.
 [http://dx.doi.org/10.18287/2412-6179-2018-42-6-1101-1111]
[28]
X. Yan, M. Zhang,  and Q. Wu, "Big data driven pre stack seismic intelligent inversion", Inf. Sci., vol. 549, pp. 34-52, 2021.
 [http://dx.doi.org/10.1016/j.ins.2020.11.012]
[29]
S.F. Suhel, V.K. Shukla, S. Vyas,  and V.P. Mishra, "Conversation to automation in banking through Chatbot using artificial machine intelligence language", In 2020 8th International Conference on Reliability,
Infocom Technologies and Optimization (Trends and Future
Directions)(ICRITO), 04-05 Jun, 2020 Noida, India, 2020, pp. 611-618 
 [http://dx.doi.org/10.1109/ICRITO48877.2020.9197825]
[30]
J. Bhola,  and S. Soni, "A study on research issues and challenges in WSAN", 2016 International Conference on Wireless Communications,
Signal Processing and Networking (WiSPNET), 23-25 Mar,
2016, Chennai, India, pp. 1667-1671, 2016.
 [http://dx.doi.org/10.1109/WiSPNET.2016.7566423]
[31]
L. Visuwasam,  and D.P. Raj, "A distributed intelligent mobile application for analyzing travel big data analytics", Peer-to-Peer Netw. Appl., vol. 13, no. 6, pp. 2036-2052, 2020.
 [http://dx.doi.org/10.1007/s12083-019-00799-z]
[32]
H. Liang, C. Yun, M.J. Kan,  and J. Gao, "Research and application of element logging intelligent identification model based on data mining", IEEE Access, vol. 7, pp. 94415-94423, 2019.
 [http://dx.doi.org/10.1109/ACCESS.2019.2928001]
[33]
Z. He, Y. He, F. Liu,  and Y. Zhao, "Big data-oriented product infant failure intelligent root cause identification using associated tree and fuzzy DEA", IEEE Access, vol. 7, pp. 34687-34698, 2019.
 [http://dx.doi.org/10.1109/ACCESS.2019.2904759]
[34]
"D. Ślęzak, R. Glick, P. Betliński and P. Synak, "A new approximate
query engine based on intelligent capture and fast transformations
of granulated data summaries", J. Intell. Inf. Syst., vol. 50, no. 2, pp. 385-414, 2018.
 [http://dx.doi.org/10.1007/s10844-017-0471-6]
[35]
Y. Xu, "Research on the improvement of accounting work quality of new agricultural business entities under the background of big data", Acta Agric. Scand. B Soil Plant Sci., vol. 72, no. 1, pp. 440-453, 2022.
 [http://dx.doi.org/10.1080/09064710.2021.2009553]
[36]
A. Baldominos, F. De Rada,  and Y. Saez, "DataCare: Big data analytics solution for intelligent healthcare management", Int. J. Interactive Multi. Arti. Intel., vol. 4, no. 7, p. 13, 2018.
 [http://dx.doi.org/10.9781/ijimai.2017.03.002]
[37]
J. Bhola, S. Soni,  and J. Kakarla, "A scalable and energy efficient MAC protocol for sensor and actor networks", Int. J. Commun. Syst., vol. 32, no. 13, p. e4057, 2019.
 [http://dx.doi.org/10.1002/dac.4057]
[38]
Y. Sun, H. Li, M. Shabaz,  and A. Sharma, "Research on building truss design based on particle swarm intelligence optimization algorithm", Int. J. Syst. Assur. Eng. Manag., vol. 13, no. 1, pp. 38-48, 2022.
 [http://dx.doi.org/10.1007/s13198-021-01192-x]
[39]
Y. Lei, F. Jia, J. Lin, S. Xing,  and S.X. Ding, "An intelligent fault diagnosis method using unsupervised feature learning towards mechanical big data", IEEE Trans. Ind. Electron., vol. 63, no. 5, pp. 3137-3147, 2016.
 [http://dx.doi.org/10.1109/TIE.2016.2519325]
[40]
A. Enayet, M.A. Razzaque, M.M. Hassan, A. Alamri,  and G. Fortino, "A mobility-aware optimal resource allocation architecture for big data task execution on mobile cloud in smart cities", IEEE Commun. Mag., vol. 56, no. 2, pp. 110-117, 2018.
 [http://dx.doi.org/10.1109/MCOM.2018.1700293]
[41]
Y.D. Wang, D.W. Xu, Y. Lu, J.Y. Shen,  and G.J. Zhang, "Compression algorithm of road traffic data in time series based on temporal correlation", IET Intell. Transp. Syst., vol. 12, no. 3, pp. 177-185, 2018.
 [http://dx.doi.org/10.1049/iet-its.2016.0244]
[42]
K.I. Moharm, E.F. Zidane, M.M. Mahdy,  and S. Tantawy, "Big data in ITS: Concept, case studies, opportunities, and challenges", IEEE Trans. Intell. Transp. Syst., vol. 20, no. 8, pp. 3189-3194, 2018.
 [http://dx.doi.org/10.1109/TITS.2018.2868852]

Rights & Permissions Print Cite

Article Metrics

1

Journal Information

For Authors

For Editors

For Reviewers

Explore Articles

Open Access

Open Access Articles

For Visitors

DOI https://dx.doi.org/10.2174/2352096515666220823093929	Print ISSN 2352-0965
Publisher Name Bentham Science Publisher	Online ISSN 2352-0973

Recent Advances in Electrical & Electronic Engineering

Study on Intelligent Analysis and Processing Technology of Computer Big Data Based on Clustering Algorithm

Abstract Play Pause

Related Journals

Related Books

Abstract