Training Data in Statistical Machine Translation - the More, the Better?
Monica GavrilaCristina VertanPublished in: RANLP (2011)
Keyphrases
- statistical machine translation
- training data
- training corpus
- machine translation
- word alignment
- language model
- decision trees
- learning algorithm
- minimum error rate
- machine translation system
- training set
- labeled data
- unlabeled data
- translation model
- chinese english
- information extraction
- cross language information retrieval
- prior knowledge
- multiword
- supervised learning
- model selection
- language independent
- target language
- search engine
- machine learning