Abstract
In this chapter, we learn
about clustering and how document and text clustering can be performed. This
chapter explains the real-time applications of text clustering and the
differences between soft and hard clustering types. The clustering algorithms,
including KNN, hierarchical and Fuzzy clustering, were used . Fuzzy clustering
or soft clustering types can add better value performance-wise than the other two
clustering algorithms. Besides, we also presented how to conduct text
clustering in python using unsupervised machine learning techniques. To explain
this in detail, the IRIS dataset is considered famous in UCI Machine learning
Repository and well presented with python script.
Keywords: Clustering, Fuzzy, IRIS dataset, KNN, Soft clustering.