Text categorization based on k-nearest neighbor approach for Web site classification.
Oh-Woog KwonJong-Hyeok LeePublished in: Inf. Process. Manag. (2003)
Keyphrases
- text categorization
- k nearest neighbor
- knn
- text classification
- document classification
- classification algorithm
- support vector machine svm
- k nearest neighbour
- knn classifier
- website
- knn algorithm
- support vector machine
- feature selection
- nearest neighbor
- reuters corpus
- text classifiers
- feature reduction
- automated text categorization
- document categorization
- information gain
- automatic text categorization
- text documents
- naive bayes
- transductive support vector machine
- multi label
- classification accuracy
- pattern recognition
- training documents
- term frequency
- web pages
- feature weighting
- nearest neighbour
- semi supervised learning
- machine learning
- decision trees
- feature space
- tf idf
- image classification
- training set
- pattern classification
- feature extraction
- unlabeled data
- prior knowledge
- query processing
- machine learning methods
- support vector
- multi class
- supervised learning
- neural network
- data sets