Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior.
Issei SatoHiroshi NakagawaPublished in: KDD (2007)
Keyphrases
- mixture model
- dirichlet prior
- knowledge discovery
- document length
- dirichlet distribution
- em algorithm
- gaussian mixture model
- language model
- probabilistic model
- model selection
- maximum likelihood
- expectation maximization
- information retrieval
- lda model
- probability density function
- generative model
- unsupervised learning
- retrieval systems
- language modeling
- latent dirichlet allocation
- object recognition
- vector space model
- document retrieval
- machine learning
- information retrieval systems
- retrieval model
- tf idf
- smoothing methods
- text mining
- data mining