An FW-BF Based Approach on Elimination of Duplicated Web Pages.
Leiming MaZhengyou XiaPublished in: IDEAL (2016)
Keyphrases
- web pages
- website
- search engine
- web page classification
- keywords
- web search engines
- web content mining
- link analysis
- link structure
- google search engine
- web data
- web data extraction
- page segmentation
- web search
- data records
- machine learning
- dynamic content
- data extraction
- deep web
- web mining
- web users
- hierarchical structure
- textual content
- web graph
- helping users
- dynamically generated
- data sets
- web content
- web server