Heuristic Ranking and Diversification of Web Documents.
Jiyin HeKrisztian BalogKatja HofmannEdgar MeijMaarten de RijkeManos TsagkiasWouter WeerkampPublished in: TREC (2009)
Keyphrases
- web documents
- tabu search
- content similarity
- information extraction
- web search engines
- semi structured
- semantic association
- search result diversification
- ranking algorithm
- ranking functions
- vector space model
- web pages
- simulated annealing
- federated search
- html documents
- keywords
- link structure
- textual information
- social annotations
- web search
- web data
- web content
- document representation
- user feedback
- ranked list
- focused crawling