A Workflow for Offline Model-Free Robotic Reinforcement Learning.

Aviral Kumar Anikait Singh Stephen Tian Chelsea Finn Sergey Levine

Published in: CoRL (2021)

Keyphrases

model free
reinforcement learning
reinforcement learning algorithms
function approximation
temporal difference
policy evaluation
policy iteration
rl algorithms
state space
robotic systems
reinforcement learning methods
real robot
mobile robot
optimal policy
learning algorithm
data mining
machine learning
learning process
supervised learning
robot control
multi agent
average reward
temporal difference learning
robotic arm
text classification