Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.
Leonie WeissweilerValentin HofmannAnjali KantharubanAnna CaiRitam DuttAmey HengleAnubha KabraAtharva KulkarniAbhishek VijayakumarHaofei YuHinrich SchützeKemal OflazerDavid R. MortensenPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- cross lingual
- n gram
- language modelling
- probabilistic model
- document retrieval
- information retrieval
- retrieval model
- statistical language models
- query expansion
- translation model
- language independent
- speech recognition
- context sensitive
- test collection
- vector space model
- ad hoc information retrieval
- digital libraries
- multiword
- cross language
- pseudo relevance feedback
- document length
- cross language information retrieval
- query terms
- mixture model
- relevance model
- cross lingual information retrieval
- web search
- language models for information retrieval