Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios.
Haipeng SunRui WangKehai ChenMasao UtiyamaEiichiro SumitaTiejun ZhaoPublished in: NAACL-HLT (2021)
Keyphrases
- machine translation
- training data
- supervised learning
- training set
- semi supervised learning
- semi supervised
- cross lingual
- pos tagging
- natural language processing
- co training
- unsupervised learning
- unlabeled data
- language independent
- labeled data
- information extraction
- lexical semantics
- language processing
- natural language generation
- language resources
- cross language information retrieval
- learning algorithm
- word sense disambiguation
- target language
- statistical machine translation
- machine translation system
- parallel corpora
- word alignment
- natural language
- chinese english
- parallel corpus
- expert systems
- grammar induction
- named entity recognition
- multilingual documents
- machine transliteration
- brazilian portuguese