Topic-Based Hard Clustering of Documents Using Generative Models.
Giovanni PontiAndrea TagarelliPublished in: ISMIS (2009)
Keyphrases
- generative model
- topic models
- latent dirichlet allocation
- document clustering
- topic modeling
- probabilistic model
- text documents
- mixture model
- hierarchical hidden markov models
- lda model
- clustering method
- em algorithm
- document collections
- information retrieval systems
- discriminative learning
- clustering algorithm
- discriminative models
- information retrieval
- generative process
- prior knowledge
- conditional random fields
- k means
- unsupervised learning
- spectral clustering
- document classification
- web documents
- document retrieval
- information extraction
- object categories
- semi supervised
- expectation maximization
- semi supervised learning
- generative and discriminative models
- representational power