A three-phase approach to document clustering based on topic significance degree.
Yinglong MaYao WangBeihong JinPublished in: Expert Syst. Appl. (2014)
Keyphrases
- tolerance rough set
- document content
- document set
- topic discovery
- topic hierarchy
- document collections
- document images
- relevant documents
- textual content
- document level
- automatic summarization
- text documents
- document corpus
- topic models
- information retrieval systems
- cross document
- information retrieval
- single document summarization
- latent topics
- document classification
- document clustering
- document retrieval
- topic detection
- focused crawler
- user queries
- multi document summarization
- document representation
- topic specific
- expert finding
- tf idf
- retrieval systems
- document summaries
- semantic information
- web documents