Learning to Play No-Press Diplomacy with Best Response Policy Iteration.
Thomas W. AnthonyTom EcclesAndrea TacchettiJános KramárIan M. GempThomas C. HudsonNicolas PorcelMarc LanctotJulien PérolatRichard EverettSatinder SinghThore GraepelYoram BachrachPublished in: NeurIPS (2020)