Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations.
Yu MengYunyi ZhangJiaxin HuangYu ZhangJiawei HanPublished in: CoRR (2022)
Keyphrases
- language model
- topic discovery
- latent space
- topic models
- probabilistic model
- latent variables
- n gram
- latent dirichlet allocation
- text classification
- document retrieval
- probabilistic latent semantic analysis
- generative model
- query expansion
- information retrieval
- retrieval model
- mixture model
- unsupervised learning
- dimensionality reduction
- k means
- clustering algorithm
- distance metric
- low dimensional
- lda model
- text mining
- document clustering
- high dimensional data