Combining Text and Heuristics for Cost-Sensitive Spam Filtering.
José María Gómez HidalgoManual Maña LópezEnrique Puertas SanzPublished in: CoNLL/LLL (2000)
Keyphrases
- cost sensitive
- spam filtering
- multi class
- cost sensitive learning
- misclassification costs
- text classification
- naive bayes
- binary classification
- cost sensitive classification
- active learning
- fraud detection
- anti spam
- class distribution
- spam filters
- spam detection
- confidence weighted
- class imbalance
- information retrieval
- boosting algorithms
- text mining
- multi label
- high dimensional data
- training examples
- text categorization
- rule extraction
- supervised learning
- training set
- training data
- similarity measure
- artificial intelligence
- data sets