Detecting Themes in Web Document Descriptors.
David K. Y. ChiuDavid R. BrooksPublished in: WebNet (1997)
Keyphrases
- web documents
- information extraction
- web pages
- semi structured
- prefetching
- keywords
- web search engines
- web logs
- web content
- html documents
- textual information
- web data
- keypoints
- image descriptors
- shape descriptors
- image retrieval
- content similarity
- search engine
- query logs
- feature descriptors
- text classification
- vector space model
- document representation
- natural language processing
- knowledge discovery