Web-based text classification in the absence of manually labeled training documents.
Chen-Ming HungLee-Feng ChienPublished in: J. Assoc. Inf. Sci. Technol. (2007)
Keyphrases
- training documents
- manually labeled
- text classification
- text categorization
- ground truth
- unlabeled documents
- text classifiers
- text documents
- training data
- machine learning
- labeled documents
- naive bayes
- feature selection
- n gram
- vector space
- bag of words
- labeled data
- text data
- document classification
- unlabeled data
- text mining
- class labels
- textual data
- multimedia
- training corpus
- knn
- video dataset
- multi label
- term frequency
- semantic features
- high quality
- classification accuracy
- action recognition
- language model
- information retrieval
- data mining