Improving Text Categorization Using Domain Knowledge.
Jingbo ZhuWenliang ChenPublished in: NLDB (2005)
Keyphrases
- vector space
- text categorization
- domain knowledge
- external knowledge
- feature selection
- text classification
- knn
- information gain
- multi label
- semi supervised learning
- automated text categorization
- k nearest neighbor
- feature weighting
- text classifiers
- text collections
- text documents
- term weighting
- multi instance multi label learning
- tf idf
- automatic text categorization
- naive bayes
- document categorization
- semantic browsing
- machine learning
- prior knowledge
- comparative evaluation
- semantic information
- reuters corpus
- distributional clustering
- bag of words
- unlabeled data
- feature selections
- training data