The SphereSearch Engine for Unified Ranked Retrieval of Heterogeneous XML and Web Documents.
Jens GraupmannRalf SchenkelGerhard WeikumPublished in: VLDB (2005)
Keyphrases
- web documents
- ranked retrieval
- web search engines
- query expansion
- information extraction
- data collections
- semi structured
- retrieval effectiveness
- web pages
- html documents
- keywords
- link structure
- n gram
- web data
- document representation
- heterogeneous data
- text mining
- databases
- user queries
- similarity search
- information retrieval
- multi dimensional
- xml elements
- exact match
- machine learning