Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.
Rohin ShahSteven H. WangCody WildStephanie MilaniAnssi KanervistoVinicius G. GoecksNicholas R. WaytowichDavid Watkins-VallsBharat PrakashEdmund MillsDivyansh GargAlexander FriesAlexandra SoulyJun Shern ChanDaniel del CastilloTom LieberumPublished in: CoRR (2022)