Universal indexes for highly repetitive document collections.
Francisco ClaudeAntonio FariñaMiguel A. Martínez-PrietoGonzalo NavarroPublished in: Inf. Syst. (2016)
Keyphrases
- document collections
- inverted file
- information retrieval systems
- document retrieval
- information retrieval
- test collection
- text retrieval
- document clustering
- document representation
- text collections
- digital libraries
- index terms
- scatter gather
- database
- text corpora
- posting lists
- document archives
- relevant documents
- cross language
- xml retrieval
- topic detection
- query processing
- text data
- ad hoc retrieval
- index structure
- data collections
- information extraction
- e learning