A Cross-Lingual Word Kernel SVM for SMT Training Corpus Selection.
Xiwu HanPublished in: CSIE (2) (2009)
Keyphrases
- training corpus
- statistical machine translation
- cross lingual
- translation model
- support vector
- machine translation
- text classification
- word alignment
- language modeling
- cross language
- feature space
- language independent
- machine translation system
- language model
- knn
- parallel corpora
- feature selection
- training data
- sentiment classification
- query translation
- cross language information retrieval
- transfer learning
- target language
- feature vectors
- source language
- machine learning
- news articles
- document clustering
- bag of words
- co occurrence
- probabilistic model
- training set