PLDA+: Parallel latent dirichlet allocation with data placement and pipeline processing.
Zhiyuan LiuYuzhou ZhangEdward Y. ChangMaosong SunPublished in: ACM Trans. Intell. Syst. Technol. (2011)
Keyphrases
- latent dirichlet allocation
- data placement
- topic models
- topic modeling
- high availability
- parallel processing
- generative model
- lda model
- query optimization
- data center
- distributed environment
- access patterns
- text mining
- gibbs sampling
- data storage
- data partitioning
- latent variables
- range queries
- distributed database systems
- xml queries
- distributed databases
- data mining
- power consumption
- dimensionality reduction
- response time
- natural language processing
- high dimensional
- database systems
- artificial intelligence