Representation Quality in Text Classification: An Introduction and Experiment.
David D. LewisPublished in: HLT (1990)
Keyphrases
- text classification
- bag of words
- high quality
- n gram
- naive bayes
- feature selection
- machine learning
- decision trees
- text mining
- image representation
- databases
- sentiment analysis
- data quality
- multi label
- labeled data
- text data
- higher quality
- text classifiers
- text categorization
- database
- knowledge discovery
- feature vectors
- knowledge base
- data sets