On Compression-Based Text Classification.
Yuval MartonNing WuLisa HellersteinPublished in: ECIR (2005)
Keyphrases
- text classification
- bag of words
- feature selection
- image compression
- text categorization
- document classification
- data compression
- n gram
- labeled data
- machine learning
- text mining
- compression ratio
- compression algorithm
- text data
- compression scheme
- unlabeled data
- naive bayes
- sentiment analysis
- knn
- semantic features
- text classifiers
- multi label
- text documents
- neural network
- compression rate
- feature extraction
- decision trees
- image classification
- data cleaning