More Than Words: Collocation Retokenization for Latent Dirichlet Allocation Models.
Jin CheevaprawatdomrongAlexandra SchofieldAttapol RutherfordPublished in: ACL (Findings) (2022)
Keyphrases
- latent dirichlet allocation
- latent topics
- lda model
- topic models
- probabilistic latent semantic analysis
- probabilistic topic models
- mixed membership
- topic modeling
- co occurrence
- latent topic models
- generative model
- statistical topic models
- text mining
- knowledge discovery
- probabilistic model
- parameter estimation
- model selection
- maximum likelihood
- gibbs sampling
- high dimensional
- feature extraction
- machine learning