Keyphrases
- web documents
- web search engines
- information extraction
- semi structured
- web pages
- document classification
- web data
- keywords
- link structure
- vector space model
- focused crawling
- structured documents
- web content
- structured data
- document representation
- textual information
- web logs
- web search
- search engine
- html documents
- web directories
- database