Login / Signup
Learning Cooperative Multi-Agent Policies with Multi-Channel Reward Curriculum Based Q-Learning.
Jayant Singh
Jing Zhou
Baltasar Beferull-Lozano
Ilya Tyapin
Published in:
IECON (2022)
Keyphrases
</>
multi channel
reinforcement learning
learning algorithm
learning process
function approximation
high level
professional development
multi agent
optimal policy
learning agent
state action
eligibility traces
anti aliasing