Web Page Classification Exploiting Contents of Surrounding Pages for Building a High-Quality Homepage Collection.
Yuxin WangKeizo OyamaPublished in: ICADL (2006)
Keyphrases
- web page classification
- anchor text
- high quality
- web pages
- web search
- test collection
- web mining
- document collections
- web content
- web documents
- search tasks
- automatic classification
- text classification
- query logs
- website
- search engine
- document representation
- database
- web users
- query terms
- metadata
- web graph
- information seeking
- link analysis
- feature selection
- knowledge discovery
- link structure
- multimedia
- machine learning