Using "Annotator Rationales" to Improve Machine Learning for Text Categorization.
Omar ZaidanJason EisnerChristine D. PiatkoPublished in: HLT-NAACL (2007)
Keyphrases
- text categorization
- text classification
- machine learning
- feature selection
- automatic text categorization
- feature generation
- multi label
- knn
- semi supervised learning
- information gain
- k nearest neighbor
- reuters corpus
- text documents
- automated text categorization
- term frequency
- naive bayes
- decision trees
- unlabeled data
- document categorization
- text classifiers
- feature selection for text categorization
- text collections
- tf idf
- transfer learning
- term weighting
- text mining
- nearest neighbor
- distributional clustering
- information extraction
- data mining