Collecting and Using Comparable Corpora for Statistical Machine Translation.
Inguna SkadinaAhmet AkerNikos MastropavlosFangzhong SuDan TufisMateja VerlicAndrejs VasiljevsBogdan BabychPaul D. CloughRobert J. GaizauskasNikos GlarosMonica Lestari ParamitaMarcis PinnisPublished in: LREC (2012)
Keyphrases
- comparable corpora
- cross language information retrieval
- news articles
- parallel corpora
- language modeling
- bilingual lexicon
- machine translation
- word pairs
- text corpora
- cross lingual
- bilingual dictionaries
- query translation
- bi directional
- text documents
- cross language
- data mining
- knowledge discovery
- text mining
- translation model
- wikipedia articles
- labor intensive
- language independent
- text classification
- n gram
- statistical model