Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong ParkKevin FransSergey LevineAviral KumarPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- online learning
- learning systems
- learning scheme
- learning tasks
- supervised learning
- state space
- markov decision processes
- support vector
- autonomous learning
- data sets
- probabilistic model
- learning problems
- action selection
- reinforcement learning algorithms
- reinforcement learning methods