Login / Signup
Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization.
Uri Gadot
Esther Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Levy
Shie Mannor
Published in:
AAAI (2024)
Keyphrases
</>
reinforcement learning
semi markov decision processes
markov decision processes
average reward
markov decision problems
reward function
robust statistics
half quadratic
multi agent
state space
low frequency
norm minimization
factored markov decision processes