Scalable techniques for document identifier assignment ininverted indexes.
Shuai DingJosh AttenbergTorsten SuelPublished in: WWW (2010)
Keyphrases
- web documents
- document collections
- database
- document images
- retrieval systems
- document classification
- inverted index
- information retrieval
- information retrieval systems
- document identifiers
- highly scalable
- text documents
- structured documents
- document retrieval
- document processing
- inverted lists
- web scale
- physical database design
- relevant documents
- tf idf
- index structure
- multi dimensional
- data structure
- machine learning
- databases