Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks.
Katherine M. CollinsCatherine WongJiahai FengMegan WeiJosh TenenbaumPublished in: CogSci (2022)
Keyphrases
- language model
- reasoning tasks
- language modeling
- description logics
- n gram
- language modelling
- temporal reasoning
- document retrieval
- probabilistic model
- retrieval model
- statistical language models
- logic programming
- automated reasoning
- query expansion
- context sensitive
- answer set programming
- information retrieval
- speech recognition
- smoothing methods
- situation calculus
- vector space model
- pseudo relevance feedback
- relevance model
- language models for information retrieval
- query terms
- artificial intelligence
- document ranking
- bayesian networks
- general purpose
- probability distribution
- relational databases
- okapi bm
- machine learning