ZenLDA: An Efficient and Scalable Topic Model Training System on Distributed Data-Parallel Platform.
Bo ZhaoHucheng ZhouGuoqiang LiYihua HuangPublished in: CoRR (2015)
Keyphrases
- topic models
- distributed data
- latent dirichlet allocation
- data sharing
- topic modeling
- data distribution
- communication cost
- text mining
- latent topics
- probabilistic model
- generative model
- databases
- co occurrence
- latent variables
- latent topic model
- data mining algorithms
- information retrieval
- probabilistic topic models
- peer to peer
- structured prediction
- machine learning