Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning.

Kun Huang Edward S. Hu Dinesh Jayaraman

Published in: CoRR (2022)

Keyphrases

autonomous robots
supervised learning
robot control
reinforcement learning
state space
learning algorithm
inverse reinforcement learning
optimal policy
reward function
multi robot
search algorithm
mobile robot
active learning
social networks
markov decision process
state action
policy search
data mining