Login / Signup
State Advantage Weighting for Offline RL.
Jiafei Lyu
Aicheng Gong
Le Wan
Zongqing Lu
Xiu Li
Published in:
Tiny Papers @ ICLR (2023)
Keyphrases
</>
reinforcement learning
state space
real time
multi agent
learning process
markov decision processes
real world
genetic algorithm
artificial intelligence
similarity measure
evolutionary algorithm
mobile robot
markov chain
function approximation