Training Data Modification for SMT Considering Groups of Synonymous Sentences.
Hideki KashiokaPublished in: EMSEE@ACL (2005)
Keyphrases
- training data
- word alignment
- decision trees
- test set
- data sets
- training set
- classification accuracy
- prior knowledge
- learning algorithm
- training corpus
- natural language
- multi document summarization
- training examples
- noisy data
- machine learning
- text summarization
- training process
- machine translation
- class labels
- training samples
- support vector machine
- domain knowledge
- test data
- training dataset
- group members
- unlabeled data
- statistical machine translation
- machine translation system