KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus.
Thanh-Le HaJan NiehuesMatthias SperberNgoc-Quan PhamAlexander H. WaibelPublished in: LREC (2018)
Keyphrases
- parallel corpus
- chinese english
- machine translation system
- cross language information retrieval
- comparable corpora
- statistical machine translation
- parallel corpora
- language resources
- cross lingual information retrieval
- cross lingual
- cross language
- language independent
- machine translation
- user friendly
- manually annotated
- query translation
- english words
- sentence pairs
- cross language ir
- vector space
- text corpora
- bilingual dictionaries
- digital libraries
- language modeling
- text retrieval
- named entities
- wordnet
- dimensionality reduction
- high dimensional