Harvesting Multi-Word Expressions from Parallel Corpora.
Spela VintarDarja FiserPublished in: LREC (2008)
Keyphrases
- multiword
- parallel corpora
- bilingual dictionaries
- statistical machine translation
- context sensitive
- machine translation
- cross language information retrieval
- language independent
- natural language
- machine translation system
- language model
- labor intensive
- cross lingual
- part of speech
- query translation
- text clustering
- word pairs
- wikipedia articles
- n gram
- semantic content
- training corpus
- domain knowledge
- information retrieval