ZenLDA: Large-scale topic model training on distributed data-parallel platform.
Bo ZhaoHucheng ZhouGuoqiang LiYihua HuangPublished in: Big Data Min. Anal. (2018)
Keyphrases
- topic models
- distributed data
- latent dirichlet allocation
- topic modeling
- data sharing
- data distribution
- generative model
- latent variables
- text mining
- co occurrence
- latent topics
- probabilistic model
- communication cost
- file system
- latent topic model
- real world
- databases
- probabilistic topic models
- statistical topic models
- data mining algorithms
- training set
- peer to peer
- structured prediction
- data streams
- pattern recognition
- database systems
- database