Web spam filtering in internet archives.
Miklós ErdélyiAndrás A. BenczúrJulien MasanèsDávid SiklósiPublished in: AIRWeb (2009)
Keyphrases
- spam filtering
- internet users
- web technologies
- internet archive
- website
- world wide web
- machine learning models
- world wide
- semantic web
- spam detection
- text classification
- web applications
- information overload
- user generated content
- web information
- information delivery
- spam filters
- anti spam
- internet usage
- web documents
- digital libraries
- increasing rapidly
- web browsing
- digital archives
- social networking sites
- business information
- online resources
- constantly growing
- web portal
- document repositories
- internet services
- link analysis
- information providers
- web users
- linked data
- digital world
- web content
- web pages
- tim berners lee
- social networking websites
- internet enabled
- data formats
- cultural heritage
- text mining
- social media
- metadata