Web spam detection: new classification features based on qualified link analysis and language models.
Lourdes AraujoJuan Martínez-RomoPublished in: IEEE Trans. Inf. Forensics Secur. (2010)
Keyphrases
- language model
- link analysis
- web spam detection
- classification accuracy
- web spam
- feature vectors
- spam detection
- language modeling
- feature set
- classification models
- feature space
- n gram
- web graph
- retrieval model
- probabilistic model
- web mining
- information retrieval
- query expansion
- machine learning
- ranking algorithm
- web pages
- test collection
- document retrieval
- text mining
- cost sensitive
- text classification
- web search
- information retrieval systems
- collaborative filtering
- supervised learning
- community detection
- vector space
- graph mining
- training data
- link structure
- relevance model
- decision trees