Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena.

Published in: CoRR (2024)

Keyphrases