Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks.
Soumen ChakrabartiMukul JoshiVivek TawdePublished in: SIGIR (2001)
Keyphrases
- topic distillation
- content and structure
- content features
- information retrieval
- web pages
- xml documents
- keywords
- website
- semi structured
- xml retrieval
- semantic information
- anchor text
- web documents
- text documents
- text retrieval
- query independent
- text mining
- link structure
- world wide web
- relational databases
- part of speech
- language model
- web search