RSTIndex : Indexing and Retrieving Web Document Using Computational and Linguistic Techniques.
Farhi MarirKamel HouamPublished in: IDEAL (2002)
Keyphrases
- web documents
- information retrieval
- information extraction
- semi structured
- web pages
- efficient retrieval
- database
- web data
- prefetching
- web logs
- web search engines
- web content
- text retrieval
- unstructured documents
- focused crawling
- html documents
- textual information
- document representation
- natural language
- indexing method
- multimedia databases
- natural language processing
- dynamically generated
- efficiently retrieve