Improving Farsi multiclass text classification using a thesaurus and two-stage feature selection.
Nooshin MaghsoodiMohammad Mehdi HomayounpourPublished in: J. Assoc. Inf. Sci. Technol. (2011)
Keyphrases
- multi class
- text classification
- feature selection
- text categorization
- naive bayes
- support vector machine
- labeled data
- multi class classification
- multiclass classification
- classification accuracy
- high dimensionality
- cost sensitive
- binary classification
- multi task
- multiclass problems
- multi label
- knn
- feature set
- k nearest neighbor
- information retrieval
- machine learning
- feature selection algorithms
- multiple classes
- binary classifiers
- error correcting output codes
- multiclass support vector machines
- dimensionality reduction
- natural language processing
- feature space
- unlabeled data
- text mining
- pairwise
- perceptron algorithm
- feature reduction
- data sets
- transfer learning
- feature subset
- model selection
- binary classification problems
- support vector
- feature extraction
- decision trees