Universal Indexes for Highly Repetitive Document Collections.
Francisco ClaudeAntonio FariñaMiguel A. Martínez-PrietoGonzalo NavarroPublished in: CoRR (2016)
Keyphrases
- document collections
- inverted file
- information retrieval systems
- document retrieval
- information retrieval
- text retrieval
- test collection
- digital libraries
- document clustering
- scatter gather
- databases
- geographic information retrieval
- cross language
- query processing
- database
- ad hoc retrieval
- posting lists
- text data
- document representation
- relevant documents
- index terms
- topic detection
- data collections
- document archives
- document clusters
- indexing techniques