Login / Signup
Blending MPC & Value Function Approximation for Efficient Reinforcement Learning.
Mohak Bhardwaj
Sanjiban Choudhury
Byron Boots
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
state space
temporal difference learning
multi agent systems
computationally expensive
reinforcement learning algorithms
neural network
multi agent
mobile robot