Boosting the Performance of Web Spam Detection with Ensemble Under-Sampling Classification.
Guanggang GengChunheng WangQiudan LiLei XuXiao-Bo JinPublished in: FSKD (4) (2007)
Keyphrases
- feature selection
- weak learners
- ensemble classification
- ensemble learning
- ensemble methods
- majority voting
- ensemble classifier
- classification accuracy
- weak classifiers
- decision trees
- multiple classifier systems
- base classifiers
- machine learning
- web spam detection
- training set
- learning algorithm
- support vector machine
- supervised learning
- training data
- benchmark datasets
- spam detection
- feature space
- binary classification problems
- image classification
- generalization ability
- machine learning methods
- topic models
- text classification
- active learning
- training samples
- classification algorithm
- support vector
- keywords