New Methods for Text Categorization Based on a New Feature Selection Method and a New Similarity Measure Between Documents.
Li-Wei LeeShyi-Ming ChenPublished in: IEA/AIE (2006)
Keyphrases
- text categorization
- document frequency
- feature selection
- similarity measure
- text classifiers
- feature weighting
- mutual information
- text classification
- text documents
- term frequency
- document classification
- term selection
- feature selection for text categorization
- term weighting
- clustering method
- feature set
- pairwise
- word frequency
- naive bayes
- information gain
- feature selection and classifier
- automatic text categorization
- semi supervised learning
- feature reduction
- classification accuracy
- knn
- support vector machine
- feature selections
- k nearest neighbor
- transductive support vector machine
- linear svm
- training documents
- document categorization
- information retrieval
- feature space
- text collections
- document representation
- automated text categorization
- tf idf
- feature subset
- data sets