Feature Selection for Document Classifier for IT documents based on SVM.
YunHee KangPublished in: International Conference on Internet Computing (2002)
Keyphrases
- text classifiers
- feature selection
- text categorization
- training documents
- text classification
- document classification
- classify documents
- text documents
- support vector
- document collections
- support vector machine
- naive bayes
- term frequency
- relevant documents
- document retrieval
- svm classifier
- document clustering
- knn
- electronic documents
- information retrieval
- document content
- web documents
- feature space
- document analysis
- information retrieval systems
- document representation
- feature set
- similar documents
- digital documents
- vector space model
- document ranking
- classification accuracy
- retrieval systems
- bayes classifier
- document type
- feature ranking
- document repository
- multi class
- feature selection algorithms
- classification method
- machine learning
- classification performances
- document frequency
- k nearest neighbor
- input features
- document set
- feature subset
- related documents
- tf idf
- fold cross validation
- keywords
- feature extraction
- retrieved documents
- selected features
- text mining
- decision trees