Login / Signup
Rank2Reward: Learning Shaped Reward Functions from Passive Video.
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
inverse reinforcement learning
reward function
prior knowledge
learning algorithm
image segmentation
pairwise
supervised learning
video sequences
optimal policy