Login / Signup
Balancing Constraints and Rewards with Meta-Gradient D4PG.
Dan A. Calian
Daniel J. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
Published in:
ICLR (2021)
Keyphrases
</>
constraint satisfaction
real time
machine learning
reinforcement learning
constrained optimization
case study
bayesian networks
evolutionary algorithm
markov decision processes