Login / Signup
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting.
Zhang-Wei Hong
Pulkit Agrawal
Remi Tachet des Combes
Romain Laroche
Published in:
ICLR (2023)
Keyphrases
</>
reinforcement learning
uci machine learning repository
function approximation
benchmark datasets
neural network
multi agent
database
action selection
trajectory data
real time
machine learning
spatio temporal
optimal policy
markov decision processes
learning problems
raw data
data sets