An improved K-nearest-neighbor algorithm for text categorization.
Shengyi JiangGuansong PangMeiling WuLimin KuangPublished in: Expert Syst. Appl. (2012)
Keyphrases
- pairwise
- text categorization
- knn
- k nearest neighbor
- feature selection
- multi label
- text classification
- nearest neighbor
- automated text categorization
- naive bayes
- feature weighting
- text documents
- reuters corpus
- information gain
- document categorization
- document classification
- neural network
- automatic text categorization
- support vector machine
- tf idf
- term frequency
- text collections
- feature selections
- term weighting
- semi supervised learning
- mutual information
- information retrieval
- machine learning
- data mining