Towards Burmese (Myanmar) Morphological Analysis: Syllable-based Tokenization and Part-of-speech Tagging.
Chenchen DingHnin Thu Zar AyeWin Pa PaKhin Thandar NwetKhin Mar SoeMasao UtiyamaEiichiro SumitaPublished in: ACM Trans. Asian Low Resour. Lang. Inf. Process. (2020)
Keyphrases
- morphological analysis
- n gram
- character n grams
- manually constructed
- named entities
- biomedical text
- biomedical information retrieval
- part of speech
- unknown words
- language model
- context free grammars
- variable length
- metadata
- pseudo relevance feedback
- language modeling
- word level
- bag of words
- text classification
- probabilistic model
- similarity measure