Multilingual Financial Word Embeddings for Arabic, English and French.
Nadhem ZmandarMahmoud El-HajPaul RaysonPublished in: IEEE BigData (2021)
Keyphrases
- language specific
- parallel corpus
- unknown words
- cross lingual
- cross language
- mono lingual
- language independent
- arabic documents
- language identification
- indian languages
- cross language information retrieval
- character n grams
- machine translation
- arabic language
- word forms
- n gram
- bilingual dictionaries
- machine translation system
- cross language ir
- translation model
- statistical machine translation
- natural language
- english words
- english text
- word segmentation
- word sense
- spoken document retrieval
- chinese english
- morphological analysis
- multilingual retrieval
- word alignment
- stop words
- monolingual retrieval
- word level
- language resources
- query translation
- vector space
- source language
- computing semantic relatedness
- cross lingual information retrieval
- dublin city university
- compound words
- word spotting
- grammar induction
- multilingual information retrieval
- handwritten documents
- handwritten words
- text classification
- printed documents
- document collections
- foreign language
- text retrieval
- language modeling
- co occurrence
- bilingual lexicon
- document retrieval
- multiword
- training corpus
- out of vocabulary
- word sense disambiguation
- part of speech
- target language
- document images
- named entity recognizer
- lexical information