Login / Signup
Policy Learning for Off-Dynamics RL with Deficient Support.
Linh Le Pham Van
Hung The Tran
Sunil Gupta
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
learning process
learning systems
learning problems
dynamical systems
learning algorithm
learning scenarios
policy gradient
partially observable domains
optimal policy
state space
learning tasks
dynamic model
adaptive control
complex domains
temporal difference learning
actor critic