Login / Signup
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification.
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
Published in:
CoRR (2021)
Keyphrases
</>
policy search
reinforcement learning
support vector
support vector machine
feature space
support vector machine svm
markov decision problems