On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research.
Made Nindyatama NityasyaHaryo Akbarianto WibowoAlham Fikri AjiGenta Indra WinataRadityo Eko PrasojoPhil BlunsomAdhiguna KuncoroPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- n gram
- document retrieval
- probabilistic model
- information retrieval
- natural language processing
- query expansion
- language modelling
- speech recognition
- document ranking
- statistical language models
- test collection
- retrieval model
- ad hoc information retrieval
- language model for information retrieval
- smoothing methods
- language models for information retrieval
- text mining
- mixture model
- pseudo relevance feedback
- context sensitive
- query terms
- part of speech
- translation model
- natural language
- information extraction
- machine learning
- clustering algorithm
- hidden markov models
- supervised learning
- dependency structure
- query specific
- statistical machine translation
- multiword
- relevance model
- cross lingual