Stemming and n-grams in Spanish: an evaluation of their impact on information retrieval.
Carlos G. FiguerolaRaquel Gómez DíazEva López de San RománPublished in: J. Inf. Sci. (2000)
Keyphrases
- n gram
- language model
- information retrieval
- language modeling
- bag of words
- question answering
- language independent
- text classification
- language modelling
- character n grams
- document retrieval
- part of speech
- statistical language modeling
- query expansion
- text mining
- word segmentation
- probabilistic model
- viterbi algorithm
- variable length
- test collection
- relevance ranking
- web documents
- information access
- co occurrence
- retrieval model
- pseudo relevance feedback
- databases
- logic programs
- image classification