TNO-UT at TREC-9: How Different are Web Documents?
Wessel KraaijThijs WesterveldPublished in: TREC (2000)
Keyphrases
- web documents
- information retrieval
- test collection
- information extraction
- web search engines
- question answering
- retrieval effectiveness
- web pages
- ad hoc retrieval
- semi structured
- keywords
- relevance judgments
- link structure
- relevance feedback
- pseudo relevance feedback
- vector space model
- web content
- structured documents
- web data
- document representation
- html documents
- query expansion
- focused crawling
- dynamically generated
- speaker diarization