Highway Reinforcement Learning.
Yuhui WangMiroslav StruplFrancesco FaccioQingyuan WuHaozhe LiuMichal GrudzienXiaoyang TanJürgen SchmidhuberPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- state space
- model free
- markov decision processes
- reinforcement learning algorithms
- temporal difference
- function approximation
- data sets
- machine learning
- neural network
- direct policy search
- robotic control
- evolutionary learning
- robot control
- control problems
- optimal policy
- supervised learning
- learning process
- bayesian networks
- decision making
- fitted q iteration
- learning problems
- data mining
- information retrieval
- perceptual aliasing
- multi agent reinforcement learning
- learning algorithm
- temporal difference learning
- function approximators
- action space
- partially observable
- dynamic programming