Login / Signup

Rank2Reward: Learning Shaped Reward Functions from Passive Video.

Daniel YangDavin TjiaJacob BergDima DamenPulkit AgrawalAbhishek Gupta
Published in: CoRR (2024)
Keyphrases
  • reinforcement learning
  • inverse reinforcement learning
  • reward function
  • prior knowledge
  • learning algorithm
  • image segmentation
  • pairwise
  • supervised learning
  • video sequences
  • optimal policy