Techniques for Improving the Performance of Naive Bayes for Text Classification.
Karl-Michael SchneiderPublished in: CICLing (2005)
Keyphrases
- text classification
- naive bayes
- text categorization
- logistic regression
- naive bayes classifier
- bag of words
- text mining
- text documents
- text classifiers
- feature selection
- uci data sets
- bayesian classifier
- uci datasets
- machine learning
- text data
- cost sensitive
- naive bayes classification
- classification algorithm
- multi label
- decision trees
- test instances
- naive bayesian classifier
- probability estimation
- labeled data
- unsupervised learning
- bayesian network classifiers
- document classification
- unlabeled data
- data sets
- probabilistic classifiers
- locally weighted
- independence assumption
- knn
- co training
- term frequency
- base classifiers
- active learning
- semantic features
- conditional independence assumption