A parallel text document clustering algorithm based on neighbors.
Yanjun LiCongnan LuoSoon Myoung ChungPublished in: Clust. Comput. (2015)
Keyphrases
- text documents
- clustering algorithm
- document clustering
- text mining
- text classification
- topic models
- text categorization
- keywords
- k means
- information extraction
- nearest neighbor
- text data
- document representation
- wordnet
- named entities
- document classification
- textual information
- text databases
- databases
- bag of words
- clustering method
- information retrieval
- text collections
- supervised learning
- knowledge discovery
- prior knowledge
- feature extraction
- face recognition
- data sets
- automatic text categorization