Login / Signup
Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning.
Bryan Brandt
Prithviraj Dasgupta
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
decision making
training data
special case
probability distribution
data points
background knowledge
sequential decision making