Stemming and N-gram matching for term conflation in Turkish texts.
F. Çuna EkmekçiogluMichael F. LynchPeter WillettPublished in: Inf. Res. (1996)
Keyphrases
- n gram
- language model
- language independent
- text classification
- bag of words
- variable length
- part of speech
- language modelling
- language modeling
- character n grams
- query terms
- graph matching
- text documents
- word segmentation
- viterbi algorithm
- natural language
- neural network
- cross language information retrieval
- term dependencies
- text mining