Analysing the Impact of Removing Infrequent Words on Topic Quality in LDA Models.
Victor BystrovViktoriia Naboka-KrellAnna Staszewska-BystrovaPeter WinkerPublished in: CoRR (2023)
Keyphrases
- latent topics
- latent dirichlet allocation
- statistical topic models
- topic models
- probabilistic topic models
- lda model
- high quality
- probabilistic model
- model selection
- parameter estimation
- topic modeling
- statistical models
- latent variable models
- linear discriminant analysis
- computer vision
- information retrieval
- neural network
- generative model
- keywords
- topic discovery