IndoKEPLER, IndoWiki, and IndoLAMA: A Knowledge-enhanced Language Model, Dataset, and Benchmark for the Indonesian Language.
Inigo RamliAdila Alfa KrisnadhiRadityo Eko PrasojoPublished in: IWBIS (2022)
Keyphrases
- language model
- language modeling
- n gram
- speech recognition
- document retrieval
- language modelling
- translation model
- retrieval model
- context sensitive
- probabilistic model
- information retrieval
- test collection
- statistical language models
- mixture model
- query expansion
- statistical machine translation
- query terms
- ad hoc information retrieval
- pseudo relevance feedback
- target language
- query specific
- language models for information retrieval
- smoothing methods
- natural language processing
- prior knowledge