Login / Signup

Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models.

Jin LiuQingquan LiWenlong Du
Published in: CoRR (2024)
Keyphrases