Abstract
Background: Considering the increasing volume of text document information on Internet pages, dealing with such a tremendous amount of knowledge becomes totally complex due to its large size. Text clustering is a common optimization problem used to manage a large amount of text information into a subset of comparable and coherent clusters.
Aims: This paper presents a novel local clustering technique, namely, β-hill climbing, to solve the problem of the text document clustering through modeling the β-hill climbing technique for partitioning the similar documents into the same cluster.
Methods: The β parameter is the primary innovation in β-hill climbing technique. It has been introduced in order to perform a balance between local and global search. Local search methods are successfully applied to solve the problem of the text document clustering such as; k-medoid and kmean techniques.
Results: Experiments were conducted on eight benchmark standard text datasets with different characteristics taken from the Laboratory of Computational Intelligence (LABIC). The results proved that the proposed β-hill climbing achieved better results in comparison with the original hill climbing technique in solving the text clustering problem.
Conclusion: The performance of the text clustering is useful by adding the β operator to the hill climbing.
Keywords: Text clustering, β-Hill climbing, local exploitation search, optimization problem, clusters, k-mean techniques.
Graphical Abstract
[http://dx.doi.org/10.1016/j.asoc.2016.08.041]
[http://dx.doi.org/10.1016/j.eswa.2017.05.002]
[http://dx.doi.org/10.4108/eai.27-2-2017.152282]
[http://dx.doi.org/10.1007/s10586-020-03075-5]
[http://dx.doi.org/10.1109/ISCAIE.2016.7575039]
[http://dx.doi.org/10.1007/s00500-014-1571-7]
[http://dx.doi.org/10.1016/j.amc.2007.12.058]
[http://dx.doi.org/10.1016/j.neucom.2015.09.045]
[http://dx.doi.org/10.18178/joig.4.1.63-66]
[http://dx.doi.org/10.1016/j.engappai.2018.05.003]
[http://dx.doi.org/10.1007/s10489-018-1190-6]
[http://dx.doi.org/10.1007/978-3-319-66984-7_18]
[http://dx.doi.org/10.5120/4611-6604]
[http://dx.doi.org/10.1504/IJAOSE.2009.022944]
[http://dx.doi.org/10.1016/j.asoc.2017.06.059]
[http://dx.doi.org/10.5121/ijcsa.2013.3604]
[http://dx.doi.org/10.1016/j.eswa.2014.11.038]
[http://dx.doi.org/10.1109/CSIT.2016.7549453]
[http://dx.doi.org/10.1007/s11227-017-2046-2]
[http://dx.doi.org/10.1007/978-81-322-2202-6_14]
[http://dx.doi.org/10.1504/IJDMB.2017.088538]
[http://dx.doi.org/10.1016/j.ieri.2013.11.053]
[http://dx.doi.org/10.1016/j.ins.2012.07.025]
[http://dx.doi.org/10.1016/j.asoc.2015.12.008]
[http://dx.doi.org/10.1016/j.eswa.2013.11.018]
[http://dx.doi.org/10.3844/jcssp.2015.453.465]
[http://dx.doi.org/10.1016/j.eswa.2013.11.018]
[http://dx.doi.org/10.3906/elk-1508-31]
[http://dx.doi.org/10.1145/3055635.3056603]
[http://dx.doi.org/10.1002/asi.22896]
[http://dx.doi.org/10.1016/j.eswa.2017.02.037]