Action space noise optimization as exploration in deterministic policy gradient for locomotion tasks.
Hesan NobakhtYong LiuPublished in: Appl. Intell. (2022)
Keyphrases
- action space
- policy gradient
- reinforcement learning
- state action
- reinforcement learning methods
- single agent
- markov decision processes
- state space
- action selection
- multi agent
- function approximators
- reinforcement learning algorithms
- dynamic environments
- function approximation
- optimization methods
- real valued
- decision problems
- computational complexity
- multi agent systems
- optimal solution