Policy Learning for Off-Dynamics RL with Deficient Support.

Linh Le Pham Van Hung The Tran Sunil Gupta

Published in: CoRR (2024)

Keyphrases

reinforcement learning
learning process
learning systems
learning problems
dynamical systems
learning algorithm
learning scenarios
policy gradient
partially observable domains
optimal policy
state space
learning tasks
dynamic model
adaptive control
complex domains
temporal difference learning
actor critic