Login / Signup

Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning.

Bryan BrandtPrithviraj Dasgupta
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • decision making
  • training data
  • special case
  • probability distribution
  • data points
  • background knowledge
  • sequential decision making