Improved Graph-Based Bilingual Corpus Selection with Sentence Pair Ranking for Statistical Machine Translation.
Wenhan ChaoZhoujun LiPublished in: ICTAI (2011)
Keyphrases
- sentence pairs
- training corpus
- parallel corpus
- statistical machine translation
- cross lingual
- sentence level
- parallel corpora
- multiword
- cross language information retrieval
- translation model
- part of speech
- recognizing textual entailment
- target language
- chinese english
- ranking algorithm
- document level
- text classification
- machine translation system
- linguistic features
- parallel texts
- machine translation
- semantic roles
- text corpus
- natural language
- word alignment
- noun phrases
- query translation
- word pairs
- word frequency
- information extraction