Unicode Normalization and Grapheme Parsing of Indic Languages.
Md. Nazmuddoha AnsaryQuazi Adibur Rahman AdibTahsin ReasatAsif Shahriyar SushmitAhmed Imtiaz HumayunSazia Morshed MehnazKanij FatemaMohammad Mamun Or RashidFarig SadequePublished in: LREC/COLING (2024)
Keyphrases
- syntactic and semantic dependencies
- context free
- word order
- context free grammars
- grammar induction
- expressive power
- language independent
- natural language processing
- natural language
- target language
- grammatical inference
- language identification
- cross lingual
- normalization method
- natural language parsing
- multi lingual
- statistical machine translation
- word recognition
- short text
- spoken language
- neural network
- text summarization
- context dependent
- language model
- preprocessing
- bayesian networks