A document clustering algorithm for discovering and describing topics.
Henry Anaya-SánchezAurora Pons-PorrataRafael Berlanga LlavoriPublished in: Pattern Recognit. Lett. (2010)
Keyphrases
- clustering algorithm
- document clustering
- text documents
- tolerance rough set
- information retrieval
- latent topics
- topic discovery
- topic detection
- keywords
- text clustering
- document set
- topic models
- document clusters
- document images
- document collections
- statistical topic models
- text collections
- document classification
- technical papers
- k means
- topic hierarchy
- document corpus
- retrieval systems
- latent dirichlet allocation
- information retrieval systems
- data clustering
- document representation
- relevant documents
- blog posts
- user queries
- related topics
- fuzzy clustering
- fuzzy c means
- document analysis
- clustering quality
- web documents
- clustering method
- wikipedia pages
- topic modeling