Semi-Supervised Latent Dirichlet Allocation and Its Application for Document Classification.
Di WangMarcus ThintAhmad Al-RubaiePublished in: Web Intelligence/IAT Workshops (2012)
Keyphrases
- document classification
- latent dirichlet allocation
- semi supervised
- text mining
- generative model
- topic extraction
- topic models
- text documents
- lda model
- text categorization
- semi supervised learning
- topic modeling
- text classification
- unlabeled data
- labeled data
- classification algorithm
- pairwise
- active learning
- supervised learning
- web documents
- co occurrence
- gibbs sampling
- unsupervised learning
- information extraction
- discriminative learning
- word alignment
- machine learning
- data analysis
- natural language processing
- feature selection
- data points
- probabilistic model
- named entities
- information retrieval
- artificial intelligence
- data mining
- training data
- databases