BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages.
Manuel MagerArturo OncevayElisabeth MagerKatharina KannNgoc Thang VuPublished in: ACL (Findings) (2022)
Keyphrases
- machine translation
- morphological segmentation
- target language
- language independent
- cross lingual
- statistical machine translation
- multilingual documents
- language specific
- language resources
- parallel corpora
- machine translation system
- query translation
- source language
- multilingual information retrieval
- natural language processing
- cross language information retrieval
- language processing
- natural language
- machine readable dictionaries
- comparable corpora
- chinese english
- bilingual dictionaries
- information extraction
- word sense disambiguation
- watershed transform
- word alignment
- grammar induction
- word level
- cross lingual information retrieval
- linguistic resources
- parallel corpus
- word order
- natural language generation
- watershed segmentation
- cross language
- n gram
- text classification
- brazilian portuguese