Text Categorization with All Substring Features.
Daisuke OkanoharaJun'ichi TsujiiPublished in: SDM (2009)
Keyphrases
- text categorization
- feature generation
- feature weighting
- text classification
- information gain
- knn
- feature selection
- k nearest neighbor
- multi label
- reuters corpus
- feature set
- feature reduction
- linear svm
- text classifiers
- semi supervised learning
- training documents
- feature selection for text categorization
- text documents
- neural network
- naive bayes
- feature vectors
- term frequency
- term weighting
- information retrieval systems
- document categorization
- image features
- document frequency
- prior knowledge
- automated text categorization
- information retrieval