Parallel Texts Extraction from Multimodal Comparable Corpora.
Haithem AfliLoïc BarraultHolger SchwenkPublished in: JapTAL (2012)
Keyphrases
- parallel corpora
- comparable corpora
- cross language information retrieval
- machine translation
- bilingual dictionaries
- labor intensive
- cross lingual
- query translation
- language independent
- word pairs
- parallel corpus
- information extraction
- automatic extraction
- machine translation system
- cross language
- information retrieval
- fully automated
- statistical machine translation
- sentence level
- news articles
- translation model
- text retrieval
- query terms
- query expansion
- similarity measure