Variance based classifier comparison in text categorization.
Atsuhiro TakasuKenro AiharaPublished in: SIGIR (2000)
Keyphrases
- text categorization
- text classifiers
- feature selection
- feature weighting
- training documents
- text classification
- feature selection and classifier
- multi label
- feature reduction
- knn
- classify documents
- multi label classification
- information gain
- document classification
- text documents
- naive bayes
- k nearest neighbor
- semi supervised learning
- reuters corpus
- classification algorithm
- feature space
- training data
- automatic text categorization
- automated text categorization
- linear svm
- term frequency
- decision trees
- neural network
- document frequency
- correlation coefficient
- unlabeled data
- feature set
- dimensionality reduction
- training set
- tf idf
- natural language processing
- data analysis
- support vector
- data sets
- feature selections