Set Cover Feature Selection for Text Categorisation and spam detection.
Elías F. CombarroJosé RanillaManuel Roberto BerdascoElena MontañésIrene DíazPublished in: Int. J. Adv. Intell. Paradigms (2009)
Keyphrases
- spam detection
- text categorisation
- set cover
- feature selection
- greedy algorithm
- approximation algorithms
- np hard
- network flow
- principal components analysis
- active learning
- solution space
- spam filtering
- greedy heuristic
- text classification
- text categorization
- text mining
- support vector
- feature extraction
- k nearest neighbour
- fraud detection
- dimensionality reduction
- multi class
- support vector machine
- text classifiers
- special case
- machine learning
- classification accuracy
- convex hull
- data sets
- information gain
- web graph
- knn
- feature space
- objective function
- decision trees
- neural network