Sign in

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models.

Wai-Chung KwanXingshan ZengYufei WangYusen SunLiangyou LiLifeng ShangQun LiuKam-Fai Wong
Published in: CoRR (2023)
Keyphrases
  • language model
  • multi task
  • multi domain
  • context sensitive
  • probabilistic model
  • language modeling
  • n gram
  • learning tasks
  • information retrieval
  • multi class
  • general purpose
  • model selection
  • multi task learning