Aligning the Un-Alignable - A Pilot Study Using a Noisy Corpus of Nonstandardized, Semi-parallel Texts.
Florian PetranPublished in: CICLing (2) (2012)
Keyphrases
- pilot study
- parallel texts
- manually annotated
- cross language information retrieval
- parallel corpora
- statistical machine translation
- machine translation system
- parallel corpus
- lexico syntactic
- ground truth
- relation extraction
- machine translation
- computer games
- bilingual dictionaries
- co occurrence
- automatically generated
- word pairs
- cross lingual
- test collection
- word alignment
- domain specific
- information extraction
- image retrieval