Login / Signup
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents.
Sukai Huang
Nir Lipovetzky
Trevor Cohn
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
learning process
multi agent
learning algorithm
decision making
multi agent systems
knowledge acquisition
background knowledge
multiagent systems
complex domains
bayesian networks
learning tasks
multiple agents
function approximators