Sign in

Minimax Weight Learning for Absorbing MDPs.

Fengyin LiYuqiang LiXianyi Wu
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • learning process
  • learning algorithm
  • learning systems
  • online learning
  • markov decision processes
  • machine learning
  • state space
  • knowledge acquisition
  • evaluation function