Document clustering via dirichlet process mixture model with feature selection.
Guan YuRui-zhang HuangZhaojun WangPublished in: KDD (2010)
Keyphrases
- image segmentation
- document clustering
- feature selection
- dirichlet process mixture models
- text documents
- text mining
- clustering algorithm
- document collections
- clustering method
- text categorization
- text classification
- bayesian model
- tf idf
- support vector
- missing data
- document representation
- generative model
- machine learning
- k means
- vector space model
- dictionary learning
- feature extraction
- k nearest neighbor
- feature space
- data mining
- knn
- knowledge discovery
- object recognition