Unsupervised language filtering using the latent dirichlet allocation.
Wei ZhangRobert A. J. ClarkYongyuan WangPublished in: INTERSPEECH (2014)
Keyphrases
- latent dirichlet allocation
- topic modeling
- topic models
- topic discovery
- lda model
- probabilistic topic models
- generative model
- text mining
- hierarchical bayesian models
- probabilistic latent semantic analysis
- gibbs sampling
- variational bayesian inference
- latent topics
- semi supervised
- hierarchical bayesian model
- natural language
- text classification
- supervised learning
- latent topic models
- variational inference
- unsupervised learning
- word counts
- pattern recognition
- databases
- latent topic model
- machine learning
- information retrieval
- support vector
- text corpora
- bag of words