On strategies for imbalanced text classification using SVM: A comparative study.
Aixin SunEe-Peng LimYing LiuPublished in: Decis. Support Syst. (2009)
Keyphrases
- text classification
- text classifiers
- feature selection
- knn
- classification method
- support vector machine svm
- machine learning
- text classification tasks
- support vector
- bag of words
- text categorization
- text mining
- k nearest neighbor
- naive bayes
- imbalanced datasets
- imbalanced data
- distributional clustering
- support vector machine
- svm classifier
- feature reduction
- multi label
- cost sensitive
- labeled data
- feature vectors
- kernel methods
- document classification
- text data
- class imbalance
- binary classification
- multi class
- online auctions
- active learning
- cost sensitive learning
- data cleaning
- natural language processing
- kernel function
- unlabeled data
- n gram