s-grams: Defining generalized n-grams for information retrieval.
Anni JärvelinAntti JärvelinKalervo JärvelinPublished in: Inf. Process. Manag. (2007)
Keyphrases
- n gram
- information retrieval
- language model
- language modeling
- language independent
- language modelling
- relevance ranking
- variable length
- information retrieval systems
- text classification
- retrieval model
- bag of words
- statistical language modeling
- vector space model
- query expansion
- search engine
- document retrieval
- part of speech
- probabilistic model
- viterbi algorithm
- real world
- artificial intelligence
- neural network
- relevant documents
- query terms
- text retrieval
- test collection
- web documents
- document representation
- document collections
- text categorization
- information extraction
- machine learning
- data mining