Efficient indexing of repeated n-grams.
Samuel J. HustonAlistair MoffatW. Bruce CroftPublished in: WSDM (2011)
Keyphrases
- n gram
- efficient indexing
- language model
- content based retrieval
- text classification
- bag of words
- similarity search
- part of speech
- variable length
- inside outside algorithm
- database
- document retrieval
- web documents
- information retrieval
- question answering
- multimedia databases
- suffix tree
- neural network
- query expansion
- multi dimensional
- text mining
- language specific