Enhance Web Pages Genre Identification Using Neighboring Pages.
Jia ZhuXiaofang ZhouGabriel Pui Cheong FungPublished in: WISE (2011)
Keyphrases
- web pages
- web documents
- website
- search engine
- web search
- link structure
- link analysis
- anchor text
- dynamically generated
- web users
- keywords
- dynamic content
- hyperlink structure
- web graph
- web data
- web server
- web information extraction
- web content
- page content
- html pages
- web spam
- web search engines
- browsing behavior
- content features
- data extraction
- web content mining
- focused crawling
- web page classification
- textual content
- web objects
- web crawler
- web crawlers
- semi structured
- web usage mining
- web browser
- related web pages
- dom tree