A Hybrid Document Feature Extraction Method Using Latent Dirichlet Allocation and Word2Vec.
Zhibo WangLong MaYanqing ZhangPublished in: DSC (2016)
Keyphrases
- latent topics
- latent dirichlet allocation
- topic models
- word counts
- topic discovery
- lda model
- topic modeling
- statistical topic models
- text documents
- document similarity
- generative model
- probabilistic latent semantic analysis
- text mining
- bag of words
- latent topic models
- gibbs sampling
- latent dirichlet
- topic extraction
- variational bayesian inference
- probabilistic topic models
- latent semantic analysis
- latent variables
- hierarchical bayesian model
- document clustering
- co occurrence
- support vector
- keywords