Building bilingual lexicon to create Dialect Tunisian corpora and adapt language model.
Rahma BoujelbaneMariem Ellouze KhemakhemSiwar BenAyedLamia Hadrich BelguithPublished in: HyTra@ACL (2013)
Keyphrases
- language model
- comparable corpora
- language modeling
- bilingual lexicon
- statistical machine translation
- document retrieval
- retrieval model
- parallel corpora
- translation model
- probabilistic model
- n gram
- information retrieval
- cross language
- cross lingual
- cross language information retrieval
- query expansion
- test collection
- pseudo relevance feedback
- machine translation
- relevance model
- vector space model
- document representation
- query terms
- context sensitive
- co occurrence