Incorporating domain knowledge into topic modeling via Dirichlet Forest priors.
David AndrzejewskiXiaojin ZhuMark CravenPublished in: ICML (2009)
Keyphrases
- topic modeling
- domain knowledge
- topic models
- latent dirichlet allocation
- dirichlet prior
- hierarchical bayesian model
- prior knowledge
- lda model
- text classification
- text mining
- scientific articles
- topic extraction
- modeling framework
- bayesian framework
- latent topics
- collaborative filtering
- probabilistic latent semantic analysis
- dirichlet distribution
- text corpora
- training data
- feature extraction
- neural network
- mixture model
- generative model
- em algorithm
- prior distribution
- co occurrence
- bayesian networks