Login / Signup
Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm.
Qinbo Bai
Mridul Agarwal
Vaneet Aggarwal
Published in:
CoRR (2021)
Keyphrases
</>
multi objective
joint optimization
learning algorithm
optimization algorithm
reinforcement learning
objective function
dynamic programming
optimal solution
simulated annealing
optimal policy
data sets
image processing
optimization problems
particle swarm optimization
actor critic
inverse reinforcement learning