N-gram Statistical Stemmer for Bangla Corpus.
Rabeya SadiaMd. Ataur RahmanMd. Hanif SeddiquiPublished in: CoRR (2019)
Keyphrases
- n gram
- language model
- statistical machine translation
- test set
- language independent
- text classification
- statistical language modeling
- variable length
- language modelling
- language modeling
- bag of words
- viterbi algorithm
- part of speech
- information retrieval
- word segmentation
- speech recognition
- inside outside algorithm
- search engine
- multiword
- web documents