Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations.
Yu MengYunyi ZhangJiaxin HuangYu ZhangJiawei HanPublished in: WWW (2022)
Keyphrases
- language model
- topic discovery
- latent space
- topic models
- probabilistic model
- latent variables
- n gram
- latent dirichlet allocation
- document retrieval
- information retrieval
- text classification
- retrieval model
- probabilistic latent semantic analysis
- mixture model
- query expansion
- low dimensional
- clustering algorithm
- generative model
- gaussian process
- distance metric
- data analysis
- k means