Inference and Evaluation of the Multinomial Mixture Model for Text Clustering
Loïs RigousteOlivier CappéFrançois YvonPublished in: CoRR (2006)
Keyphrases
- mixture model
- text clustering
- em algorithm
- probabilistic model
- expectation maximization
- gaussian mixture model
- dirichlet distribution
- generative model
- text data
- maximum likelihood
- text categorization
- language model
- probability density function
- k means
- text documents
- text mining
- density estimation
- text classification
- hierarchical clustering
- document clustering
- unsupervised learning
- model selection
- user feedback
- bayesian networks
- wordnet
- information retrieval
- knowledge discovery
- metric learning
- training set
- document representation
- image segmentation