Spam web page detection using combined content and link features.
Rajendra Kumar RoulShubham Rohan AsthanaGaurav KumarPublished in: Int. J. Data Min. Model. Manag. (2016)
Keyphrases
- false positives
- feature extraction
- spam detection
- web pages
- feature space
- web documents
- search engine
- website
- low level
- spam classification
- content features
- textual features
- web resources
- web content
- detection algorithm
- object detection
- feature vectors
- visual features
- feature set
- semantic information
- co occurrence
- keypoints
- web graph
- keywords
- multimedia
- page segmentation
- metadata