The Research of Web Page De-duplication Based on Web Pages Reshipment Statement.
Min-yan WangDong-sheng LiuPublished in: DBTA (2009)
Keyphrases
- web pages
- website
- web page classification
- search engine
- web documents
- web search
- web content
- keywords
- web search engines
- link analysis
- web server
- text classification
- data extraction
- textual contents
- web users
- page segmentation
- content features
- social bookmarking
- web databases
- web communities
- web spam
- web browser
- social networks