Augmenting Statistical Machine Translation with Subword Translation of Out-of-Vocabulary Words.
Nelson F. LiuJonathan MayMichael PustKevin KnightPublished in: CoRR (2018)
Keyphrases
- out of vocabulary
- statistical machine translation
- parallel corpora
- cross language information retrieval
- language model
- machine translation
- chinese english
- spoken document retrieval
- translation model
- cross lingual
- machine translation system
- n gram
- query translation
- word alignment
- word segmentation
- language independent
- parallel corpus
- cross language
- language modeling
- speech recognition
- bilingual dictionaries
- multiword
- query terms
- broadcast news
- document retrieval
- source language
- named entity recognition
- context sensitive
- information extraction
- named entities
- query expansion
- test collection
- word pairs