Self-Training for Unsupervised Neural Machine Translation in Unbalanced Training Data Scenarios.
Haipeng SunRui WangKehai ChenMasao UtiyamaEiichiro SumitaTiejun ZhaoPublished in: CoRR (2020)
Keyphrases
- machine translation
- training data
- training set
- supervised learning
- semi supervised
- semi supervised learning
- unsupervised learning
- pos tagging
- natural language processing
- language independent
- co training
- cross lingual
- target language
- labeled data
- statistical machine translation
- language processing
- unlabeled data
- word sense disambiguation
- language resources
- learning algorithm
- lexical semantics
- natural language generation
- information extraction
- word level
- natural language
- training corpus
- brazilian portuguese
- cross language information retrieval
- query translation
- chinese english
- machine learning
- word alignment
- grammar induction
- machine translation system
- active learning
- pairwise
- knowledge base
- artificial intelligence
- multilingual documents
- data mining