Indexes for highly repetitive document collections.
Francisco ClaudeAntonio FariñaMiguel A. Martínez-PrietoGonzalo NavarroPublished in: CIKM (2011)
Keyphrases
- document collections
- inverted file
- document retrieval
- information retrieval systems
- test collection
- information retrieval
- text retrieval
- digital libraries
- scatter gather
- relevant documents
- databases
- geographic information retrieval
- topic detection
- document clustering
- index terms
- ad hoc retrieval
- cross language
- xml retrieval
- document representation
- data collections
- posting lists
- text corpora
- database
- text data
- query processing
- inverted index
- inverted lists
- metadata
- machine learning
- document archives