Fast and Simple Deterministic Seeding of KMeans for Text Document Clustering.
Ehsan SherkatJulien VelcinEvangelos E. MiliosPublished in: CLEF (2018)
Keyphrases
- document clustering
- text clustering
- text mining
- text documents
- k means
- automatic categorization
- clustering algorithm
- clustering quality
- document categorization
- document representation
- topic detection
- document corpus
- document collections
- vector space model
- automatic summarization
- negative matrix factorization
- document clusters
- keywords
- tolerance rough set
- artificial intelligence
- text data
- text retrieval
- clustering method
- text collections
- pairwise constraints
- wordnet
- text classification
- information extraction
- information retrieval
- bisecting k means