Latent Dirichlet Allocation (LDA) for improving the topic modeling of the official bulletin of the spanish state (BOE).
J. C. Bailón-ElviraManuel J. CoboEnrique Herrera-ViedmaAntonio Gabriel López-HerreraPublished in: ITQM (2019)
Keyphrases
- latent dirichlet allocation
- topic modeling
- topic models
- generative model
- latent topics
- lda model
- topic discovery
- topic extraction
- probabilistic latent semantic analysis
- text mining
- gibbs sampling
- probabilistic topic models
- data mining
- collapsed gibbs sampling
- co occurrence
- variational bayesian inference
- latent semantic analysis
- text classification
- dimensionality reduction
- text corpora
- latent variables
- text documents
- latent variable models
- image representation
- pattern recognition
- bayesian networks
- latent topic models
- training data