A Novel Distributed K-Means Clustering Algorithm for Big Text Data.
Min LiMeijing LiYonglong ChengKeun Ho RyuPublished in: HP3C (2023)
Keyphrases
- text data
- text mining
- text classification
- topic hierarchies
- text documents
- high dimensional
- structured data
- document collections
- clustering algorithm
- high dimensional data
- co occurrence
- text categorization
- low dimensional
- xml documents
- web pages
- database
- information extraction
- knowledge discovery
- pattern recognition
- data sets