Fast and Effective Text Mining Using Linear-Time Document Clustering.
Bjornar LarsenChinatsu AonePublished in: KDD (1999)
Keyphrases
- document clustering
- text mining
- text clustering
- text documents
- negative matrix factorization
- biomedical literature
- clustering method
- document collections
- vector space model
- clustering algorithm
- document representation
- data mining
- information extraction
- databases
- text data
- machine learning
- knowledge discovery
- k means
- data analysis
- named entities
- information retrieval
- clustering quality
- document clusters
- text classification
- topic models
- tolerance rough set
- topic extraction
- textual data
- document classification
- latent dirichlet allocation
- co occurrence
- pairwise
- multiscale
- real world