Login / Signup

MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset.

Weiqi WangYangqiu Song
Published in: CoRR (2024)
Keyphrases