Identifying Spam Web Pages Based on Content Similarity.
Maria Soledad PeraYiu-Kai NgPublished in: ICCSA (2) (2008)
Keyphrases
- content similarity
- web documents
- web pages
- web spam
- web spam detection
- spam detection
- similarity metric
- search engine
- adversarial information retrieval
- keywords
- web search
- web search engines
- link analysis
- link structure
- vector space model
- pairwise
- information retrieval
- data objects
- data mining
- information retrieval systems