Text Classification from Labeled and Unlabeled Documents using EM.
Kamal NigamAndrew McCallumSebastian ThrunTom M. MitchellPublished in: Mach. Learn. (2000)
Keyphrases
- unlabeled documents
- text classification
- labeled documents
- unsupervised learning
- text categorization
- text classifiers
- feature selection
- bag of words
- expectation maximization
- text mining
- n gram
- machine learning
- naive bayes
- em algorithm
- training documents
- text documents
- labeled data
- unlabeled data
- document classification
- maximum likelihood
- generative model
- text data
- multi label
- naive bayes classifier
- term frequency
- co training
- feature vectors
- feature space
- probabilistic model
- semi supervised learning
- semantic features
- high dimensional