Login / Signup
Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning with Asymmetric Rewards.
Yasin Sonmez
Neelay Junnarkar
Murat Arcak
Published in:
CoRR (2024)
Keyphrases
</>
model based reinforcement learning
markov decision processes
reinforcement learning
state space
finite state
optimal policy
dynamic programming
dynamical systems
decision processes
partially observable
reward function
policy iteration
average cost
action space