Unsupervised Document Classification with Informed Topic Models.
Timothy A. MillerDmitriy DligachGuergana SavovaPublished in: BioNLP@ACL (2016)
Keyphrases
- document classification
- topic models
- text documents
- topic modeling
- text mining
- text classification
- latent dirichlet allocation
- probabilistic topic models
- lda model
- latent variables
- text analysis
- text categorization
- news articles
- latent topics
- co occurrence
- probabilistic model
- semi supervised
- generative model
- information retrieval
- information extraction
- natural language processing
- gibbs sampling
- document clustering
- machine learning
- classification algorithm
- unsupervised learning
- knowledge discovery
- bag of words
- web documents
- unlabeled data
- labeled data
- object recognition
- bayesian networks
- data mining