A dirichlet multinomial mixture model-based approach for short text clustering.
Jianhua YinJianyong WangPublished in: KDD (2014)
Keyphrases
- mixture model
- short text
- dirichlet distribution
- em algorithm
- dirichlet prior
- expectation maximization
- probabilistic model
- mixture modeling
- short text classification
- gaussian mixture model
- density estimation
- topic detection
- unsupervised learning
- maximum likelihood
- generative model
- text classification
- language model
- k means
- discrete data
- clustering algorithm
- probabilistic mixture model
- information retrieval
- feature selection
- data points
- latent variables
- probability density function
- document collections
- model selection