Document clustering using dirichlet process mixture model of von Mises-Fisher distributions.
Nguyen Kim AnhNguyen The TamNgo Van LinhPublished in: SoICT (2013)
Keyphrases
- document clustering
- von mises fisher
- unit sphere
- dirichlet process mixture models
- mixture model
- multi variate
- generative model
- text mining
- clustering method
- document collections
- document representation
- clustering algorithm
- text documents
- dirichlet process
- tf idf
- k means
- em algorithm
- probabilistic model
- model selection
- random variables
- clustering quality
- multiscale
- data analysis
- gaussian distribution
- supervised learning
- cluster analysis
- expectation maximization