Using Bag-of-Concepts to Improve the Performance of Support Vector Machines in Text Categorization.
Magnus SahlgrenRickard CösterPublished in: COLING (2004)
Keyphrases
- text categorization
- feature selection
- support vector
- text classification
- external knowledge
- knn
- reuters corpus
- document categorization
- information gain
- multi label
- feature weighting
- k nearest neighbor
- naive bayes
- linear svm
- text documents
- training documents
- automatic text categorization
- semi supervised learning
- data sets
- bag of words
- classification accuracy
- tf idf
- term frequency
- pairwise
- feature selection for text categorization
- svm classifier
- multi instance multi label learning
- automated text categorization
- term weighting
- kernel function
- machine learning
- training examples
- knowledge discovery
- active learning
- neural network