Exploiting the value of class labels on high-dimensional feature spaces: topic models for semi-supervised document classification.
Hossein SoleimaniDavid J. MillerPublished in: Pattern Anal. Appl. (2019)
Keyphrases
- document classification
- class labels
- topic models
- semi supervised
- labeled data
- text documents
- classification algorithm
- unlabeled data
- text mining
- supervised learning
- text classification
- generative model
- semi supervised learning
- text categorization
- topic modeling
- latent dirichlet allocation
- training data
- active learning
- input space
- latent variables
- co training
- co occurrence
- training examples
- pairwise
- probabilistic model
- unsupervised learning
- multi label
- markov networks
- natural language processing
- learning algorithm
- transfer learning
- training set
- web documents
- feature set
- information extraction
- prior knowledge
- document clustering
- image processing
- high dimensional
- model selection
- data points
- information retrieval
- knowledge discovery
- machine learning