Sign in

Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward.

Tengyu XuYingbin Liang
Published in: CoRR (2022)
Keyphrases
  • reinforcement learning
  • state space
  • cost effective
  • database
  • real time
  • data structure
  • dynamic programming
  • lightweight
  • markov decision processes
  • reinforcement learning algorithms