An Improved Text Categorization Methodology Based on Second and Third Order Probabilistic Feature Extraction and Neural Network Classifiers.
Dimitrios A. KarrasPublished in: KES (1) (2006)
Keyphrases
- text categorization
- feature extraction
- feature selection
- text classification
- knn
- k nearest neighbor
- information gain
- feature vectors
- automated text categorization
- automatic text categorization
- text documents
- document categorization
- naive bayes
- multi label
- text classifiers
- reuters corpus
- bayesian networks
- feature weighting
- neural network
- term selection
- semi supervised learning
- feature selections
- principal component analysis
- feature space
- image classification
- feature reduction
- feature selection for text categorization
- data sets
- machine learning
- feature selection and classifier
- natural language
- training documents
- knowledge discovery
- document frequency
- support vector machine
- semi supervised
- natural language processing
- term frequency
- text mining
- feature set
- mutual information
- information retrieval systems