Efficient Update of Indexes for Dynamically Changing Web Documents.
Lipyeow LimMin WangSriram PadmanabhanJeffrey Scott VitterRamesh C. AgarwalPublished in: World Wide Web (2007)
Keyphrases
- web documents
- dynamically changing
- semi structured
- web pages
- information extraction
- keywords
- focused crawling
- document classification
- web content
- html documents
- database
- web search engines
- textual information
- databases
- vector space model
- structured documents
- link structure
- topic specific
- query processing
- content similarity