Consistent Improvement in Translation Quality of Chinese-Japanese Technical Texts by Adding Additional Quasi-parallel Training Data.
Wei YangYves LepagePublished in: WAT (2014)
Keyphrases
- training data
- chinese texts
- japanese language
- learning algorithm
- decision trees
- high quality
- supervised learning
- training corpus
- quality improvement
- native speakers
- domain knowledge
- data sets
- parallel processing
- chinese characters
- prior knowledge
- chinese english
- data quality
- chinese text
- english words
- translation model
- text documents
- training examples
- co occurrence
- machine learning