Bilingual Word Embeddings for Bilingual Terminology Extraction from Specialized Comparable Corpora.
Amir HazemEmmanuel MorinPublished in: IJCNLP(1) (2017)
Keyphrases
- comparable corpora
- terminology extraction
- bilingual lexicon
- word pairs
- cross language information retrieval
- parallel corpora
- language modeling
- news articles
- translation model
- bilingual dictionaries
- machine translation
- text corpora
- query translation
- parallel corpus
- text documents
- statistical machine translation
- textual data
- n gram
- cross lingual
- language model
- language independent
- machine translation system
- semantic relations
- sentence level
- tf idf
- information retrieval
- cross language
- document clustering
- text classification
- co occurrence
- text mining
- image retrieval
- keywords