Comparison of Stemming and N-gram Matching for Term Conflation in Arabic Text.
Hani Abu-SalemPublished in: Int. J. Comput. Process. Orient. Lang. (2004)
Keyphrases
- n gram
- arabic text
- language model
- text classification
- variable length
- bag of words
- language independent
- language modelling
- information retrieval
- character n grams
- viterbi algorithm
- language modeling
- inside outside algorithm
- part of speech
- word segmentation
- probabilistic model
- machine learning
- document representation
- web documents
- natural language processing
- information extraction
- statistical language modeling