Divergence-based feature selection for naïve Bayes text classification.
Huizhen WangJingbo ZhuKeh-Yih SuPublished in: NLPKE (2008)
Keyphrases
- support vector machine
- text classification
- feature selection
- bayes classifiers
- bayes classifier
- tree augmented
- naive bayes classifier
- multi class
- text categorization
- naive bayes
- jensen shannon divergence
- bayesian classifiers
- support vector
- machine learning
- bag of words
- feature selection algorithms
- knn
- classification accuracy
- mutual information
- classification models
- text documents
- text classifiers
- n gram
- k nearest neighbor
- feature space
- dimensionality reduction
- unsupervised learning
- labeled data
- feature engineering
- feature weighting
- feature extraction
- bayesian network classifiers
- multi label
- relative entropy
- feature set
- term frequency
- unlabeled data
- supervised feature selection
- data cleaning
- bayesian classifier
- microarray
- semi supervised learning
- text mining
- high dimensional
- bayesian networks
- information retrieval