Login / Signup
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization.
Uri Gadot
Esther Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Levy
Shie Mannor
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
markov decision processes
reward function
semi markov decision processes
markov decision problems
state space
average reward
finite state
reinforcement learning algorithms
rank minimization
factored mdps
sequential decision making problems