Login / Signup
Value-Penalized Auxiliary Control from Examples for Learning without Rewards or Demonstrations.
Trevor Ablett
Bryan Chan
Jayce Haoran Wang
Jonathan Kelly
Published in:
CoRR (2024)
Keyphrases
</>
learning algorithm
learning process
reinforcement learning
learning tasks
active learning
supervised learning
positive examples
data sets
learning problems
machine learning
learning environment
prior knowledge
semi supervised
control method
control rules
credit assignment