Code-switching Language Modeling With Bilingual Word Embeddings: A Case Study for Egyptian Arabic-English.
Injy HamedMoritz ZhuMohamed ElmahdySlim AbdennadherNgoc Thang VuPublished in: CoRR (2019)
Keyphrases
- language modeling
- cross lingual
- parallel corpus
- n gram
- language model
- translation model
- comparable corpora
- word sense
- unknown words
- word segmentation
- word alignment
- statistical machine translation
- cross language
- retrieval model
- character n grams
- information retrieval
- statistical language modeling
- machine translation system
- bilingual dictionaries
- language independent
- query expansion
- sentence pairs
- vector space
- multiword
- out of vocabulary
- probabilistic model
- source language
- parallel corpora
- query translation
- english chinese
- word level
- arabic documents
- low dimensional
- translation probabilities
- handwriting recognition
- document retrieval
- finite state transducers
- text classification
- term frequency
- co occurrence
- retrieval effectiveness
- statistical translation models
- word pairs
- relevance model
- cross language information retrieval
- machine translation
- tf idf
- text retrieval
- test collection
- dimensionality reduction
- feature selection