Login / Signup
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning.
Kun Huang
Edward S. Hu
Dinesh Jayaraman
Published in:
CoRR (2022)
Keyphrases
</>
autonomous robots
supervised learning
robot control
reinforcement learning
state space
learning algorithm
inverse reinforcement learning
optimal policy
reward function
multi robot
search algorithm
mobile robot
active learning
social networks
markov decision process
state action
policy search
data mining