Login / Signup
Fast Model-based Policy Search for Universal Policy Networks.
Buddhika Laknath Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
Published in:
ICPR (2022)
Keyphrases
</>
policy search
reinforcement learning
continuous state
dynamic programming
reinforcement learning algorithms
policy gradient
markov decision problems
model free
random walk
dynamic environments
markov decision processes
function approximation
action space
function approximators