Ranking and selecting terms for text categorization via SVM discriminate boundary.
Tien-Fang KuoYasutoshi YajimaPublished in: Int. J. Intell. Syst. (2010)
Keyphrases
- text categorization
- knn
- feature selection
- text classifiers
- k nearest neighbor
- term frequency
- text collections
- text classification
- term selection
- multi label
- support vector
- term weighting
- linear svm
- automatic text categorization
- reuters corpus
- document frequency
- information gain
- support vector machine svm
- distributional clustering
- automated text categorization
- feature reduction
- naive bayes
- nearest neighbor
- support vector machine
- ranking algorithm
- text documents
- feature weighting
- tf idf
- machine learning
- learning to rank
- unsupervised learning
- information retrieval systems
- web search
- classification accuracy
- training set
- feature selections