Text Categorization Using Modified-CHI Feature Selection and Document/Term Frequencies.
Zhaohui ZhengSargur N. SrihariPublished in: ICMLA (2002)
Keyphrases
- text categorization
- term frequency
- feature selection
- information gain
- text classification
- document frequency
- text documents
- tf idf
- term weighting
- document classification
- automatic text categorization
- knn
- naive bayes
- text classifiers
- term weights
- k nearest neighbor
- feature set
- feature extraction
- unsupervised learning
- semi supervised learning
- mutual information
- machine learning
- word frequency
- data mining
- retrieved documents
- keywords
- support vector
- text data
- feature space
- support vector machine