Login / Signup
Balancing Constraints and Rewards with Meta-Gradient D4PG.
Dan A. Calian
Daniel J. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
information retrieval
global constraints
real time
search algorithm
constraint satisfaction
meta level
data sets
computer vision
information systems
evolutionary algorithm
markov decision processes
constrained optimization
credit assignment