Linguistically-Based Sub-Sentential Alignment for Terminology Extraction from a Bilingual Automotive Corpus.
Lieve MackenEls LefeverVéronique HostePublished in: COLING (2008)
Keyphrases
- terminology extraction
- cross language information retrieval
- parallel texts
- comparable corpora
- sentence pairs
- parallel corpora
- parallel corpus
- machine translation
- word alignment
- sentence level
- statistical machine translation
- query translation
- word pairs
- linguistic resources
- news articles
- text corpora
- textual data
- cross language
- machine translation system
- text corpus
- translation model
- bilingual dictionaries
- document clustering
- information retrieval
- language modeling
- query terms
- information extraction
- digital libraries