Keyphrases
- topic discovery
- document clustering
- topic extraction
- k means
- clustering approaches
- clustering algorithm
- clustering method
- document content
- synthetic and real datasets
- text clustering
- topic detection
- document set
- synthetic datasets
- tolerance rough set
- document classification
- web documents
- data clustering
- hierarchical clustering
- high dimensional datasets
- data mining tasks
- document collections
- retrieval systems
- information retrieval systems
- document corpus
- information retrieval
- latent topics
- automatic summarization
- scientific papers
- document level
- latent dirichlet allocation
- document images
- self organizing maps
- keywords
- categorical data
- database
- outlier detection
- benchmark datasets
- high dimensional data
- cluster membership
- text mining
- data mining