Style-Agnostic Reinforcement Learning.

Juyong Lee Seokjun Ahn Jaesik Park

Published in: CoRR (2022)

Keyphrases

reinforcement learning
function approximation
state space
reinforcement learning algorithms
temporal difference
model free
markov decision processes
machine learning
learning algorithm
multi agent
databases
robotic control
function approximators
learning process
database
action selection
optimal control
optimal policy
real robot
direct policy search