Rare Word Translation Extraction from Aligned Comparable Documents.
Emmanuel ProchassonPascale FungPublished in: ACL (2011)
Keyphrases
- bilingual lexicon
- source language
- parallel corpus
- word frequencies
- parallel corpora
- word spotting
- bilingual dictionaries
- comparable corpora
- multiword
- statistical machine translation
- machine translation
- word pairs
- translation model
- text corpus
- target language
- machine translation system
- information retrieval
- keywords
- training corpus
- web documents
- character n grams
- word frequency
- information retrieval systems
- document collections
- latent topics
- text documents
- relevant documents
- printed documents
- natural language text
- term weighting
- document retrieval
- english words
- information extraction
- cross lingual
- query words
- cross language information retrieval
- page layout
- xml documents
- co occurrence
- sentence level
- linguistic information
- n gram
- concept space
- noun phrases
- word alignment
- spoken documents
- related words
- cross language
- arabic documents
- term frequency
- word sense disambiguation
- stop words
- related documents
- language model
- question answering