Evaluating Hard and Soft Flat-Clustering Algorithms for Text Documents.
Vivek Kumar SinghTanveer J. SiddiquiManoj Kumar SinghPublished in: IHCI (2011)
Keyphrases
- text documents
- document clustering
- clustering algorithm
- text mining
- text classification
- text categorization
- text clustering
- wordnet
- information extraction
- keywords
- topic models
- news articles
- bag of words
- document classification
- text data
- text collections
- k means
- information retrieval
- named entities
- data sets
- automatic text categorization
- learning algorithm
- neural network
- language model
- knn
- probabilistic model
- clustering quality
- relevant concepts