Login / Signup

Evaluating Language-Model Agents on Realistic Autonomous Tasks.

Megan KinnimentLucas Jun Koba SatoHaoxing DuBrian GoodrichMax HasinLawrence ChanLuke Harold MilesTao R. LinHjalmar WijkJoel BurgetAaron HoElizabeth BarnesPaul Christiano
Published in: CoRR (2023)
Keyphrases