Performance thresholding in practical text classification.
Hinrich SchützeEmre VelipasaogluJan O. PedersenPublished in: CIKM (2006)
Keyphrases
- text classification
- bag of words
- text categorization
- naive bayes
- real world
- machine learning
- feature selection
- knn
- text classifiers
- image segmentation
- practical application
- multi label
- information retrieval
- support vector
- text documents
- data cleaning
- labeled data
- text data
- thresholding algorithm
- data sets
- n gram
- gray level
- unlabeled data
- co occurrence
- supervised learning
- data analysis
- image processing
- neural network
- databases