A parallel text clustering method using Spark and hashing.
Mohamed Aymen Ben HajKacemChiheb-Eddine Ben N'cirNadia EssoussiPublished in: Computing (2021)
Keyphrases
- clustering method
- clustering algorithm
- spectral clustering
- fuzzy c means
- cluster analysis
- information retrieval
- relational clustering
- text mining
- subspace clustering
- hierarchical clustering
- text retrieval
- k means
- keywords
- similarity measure
- spatial clustering
- clustering analysis
- document clustering
- text documents
- dissimilarity measure
- clustering framework
- affinity propagation
- knn
- hierarchical agglomerative clustering
- data structure
- similarity search
- nearest neighbor
- textual data
- document collections
- unsupervised clustering
- unsupervised learning
- fuzzy c means clustering
- information retrieval systems