How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in Neural Machine Translation?
Ali AraabiChristof MonzVlad NiculaePublished in: CoRR (2022)
Keyphrases
- machine translation
- out of vocabulary
- cross language information retrieval
- language specific
- chinese english
- parallel corpora
- cross lingual
- english chinese
- word sense disambiguation
- word level
- n gram
- language independent
- word segmentation
- spoken document retrieval
- language model
- named entity recognition
- query words
- word alignment
- machine translation system
- broadcast news
- natural language processing
- information extraction
- parallel corpus
- natural language
- statistical machine translation
- language processing
- query translation
- target language
- named entities
- hand crafted
- cross language
- text retrieval
- keywords