Chaos-based reinforcement learning with TD3.
Toshitaka MatsukiYusuke SakemiKazuyuki AiharaPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- temporal difference
- reinforcement learning algorithms
- eligibility traces
- function approximation
- temporal difference learning
- state space
- td learning
- model free
- policy evaluation
- reinforcement learning methods
- evaluation function
- learning algorithm
- reinforcement learning problems
- markov decision processes
- optimal policy
- control problems
- action selection
- function approximators
- temporal difference methods
- supervised learning
- dynamic programming
- chaotic maps
- optimal control
- robotic control
- machine learning
- multi agent
- search space
- policy search
- rl algorithms
- markov chain
- partially observable