Semi-Automatic Parallel Corpora Extraction from Comparable News Corpora.
Thoudam Doren SinghSivaji BandyopadhyayPublished in: Polibits (2010)
Keyphrases
- semi automatic
- parallel corpora
- labor intensive
- comparable corpora
- news articles
- fully automatic
- machine translation
- cross language information retrieval
- parallel texts
- cross lingual
- language independent
- statistical machine translation
- semi automatically
- information extraction
- sentence pairs
- bilingual dictionaries
- domain ontology
- parallel corpus
- wikipedia articles
- machine translation system
- query translation
- cross language
- domain knowledge
- word pairs
- domain specific