Improving Reliability of Latent Dirichlet Allocation by Assessing Its Stability Using Clustering Techniques on Replicated Runs.
Jonas RiegerLars KoppersCarsten JentschJörg RahnenführerPublished in: CoRR (2020)
Keyphrases
- latent dirichlet allocation
- topic discovery
- topic models
- probabilistic latent semantic analysis
- probabilistic latent semantic indexing
- lda model
- topic modeling
- topic extraction
- latent topics
- generative model
- tag information
- clustering algorithm
- clustering method
- gibbs sampling
- variational inference
- variational bayesian inference
- latent topic models
- text mining
- data points
- text analysis
- co occurrence
- latent variables
- nonnegative matrix factorization
- document clustering
- cluster analysis
- text classification
- hierarchical bayesian model
- k means
- spectral clustering
- text documents
- em algorithm
- dimensionality reduction