A TNATS Approcah to Hidden Web Documents.
Yih-Ling HedleyMuhammad YounasAnne E. JamesPublished in: ICDCIT (2004)
Keyphrases
- web documents
- information extraction
- web search engines
- semi structured
- document classification
- keywords
- web data
- web pages
- html documents
- document representation
- vector space model
- focused crawling
- link structure
- web content
- web logs
- content similarity
- unstructured documents
- geographic information
- structured documents
- textual information
- query processing
- website