A Large and Diverse Arabic Corpus for Language Modeling.
Abbas Raza AliMuhammad Ajmal SiddiquiRema AlgunaibetHasan Raza AliPublished in: KES (2023)
Keyphrases
- language modeling
- language model
- information retrieval
- retrieval model
- query expansion
- n gram
- cross lingual
- probabilistic model
- comparable corpora
- statistical machine translation
- text classification
- document retrieval
- test collection
- multiword
- document level
- improvements in retrieval effectiveness
- word segmentation
- handwriting recognition
- translation model
- statistical language modeling
- text corpora
- relevance model
- search engine