Blending MPC & Value Function Approximation for Efficient Reinforcement Learning.

Mohak Bhardwaj Sanjiban Choudhury Byron Boots

Published in: CoRR (2020)

Keyphrases

reinforcement learning
state space
temporal difference learning
multi agent systems
computationally expensive
reinforcement learning algorithms
neural network
multi agent
mobile robot