Using chi-square statistics to measure similarities for text categorization.
Yao-Tsung ChenMeng Chang ChenPublished in: Expert Syst. Appl. (2011)
Keyphrases
- chi square
- text categorization
- information gain
- term frequency
- feature selection
- confidence intervals
- text classification
- knn
- logistic regression
- k nearest neighbor
- similarity measure
- text documents
- naive bayes
- semi supervised learning
- mutual information
- decision trees
- information extraction
- image classification
- prior knowledge
- support vector
- feature set
- nearest neighbor
- information theoretic
- semi supervised
- neural network