An Improved Arabic Word's roots Extraction method using n-Gram Technique.
Nidal YousefAymen M. Abu-ErrubAshraf OdehHayel KhafajehPublished in: J. Comput. Sci. (2014)
Keyphrases
- n gram
- language model
- language independent
- character n grams
- text classification
- language modelling
- bag of words
- word segmentation
- variable length
- unknown words
- language modeling
- part of speech
- statistical language modeling
- viterbi algorithm
- inside outside algorithm
- word level
- feature selection
- speech recognition
- web documents
- out of vocabulary
- language specific
- text categorization
- hidden markov models
- finite state transducers
- data mining