Distributed implementation of the latent Dirichlet allocation on Spark.
Karim SayadiQuang Vu BuiMarc BuiPublished in: SoICT (2016)
Keyphrases
- latent dirichlet allocation
- topic models
- topic modeling
- generative model
- latent topics
- variational bayesian inference
- distributed systems
- probabilistic latent semantic indexing
- probabilistic latent semantic analysis
- variational inference
- topic discovery
- text mining
- latent topic models
- lda model
- probabilistic topic models
- information retrieval
- hierarchical bayesian model
- feature selection
- latent topic model
- word counts
- gibbs sampling
- information extraction
- latent variables