Is High Variance Unavoidable in RL? A Case Study in Continuous Control.
Johan BjorckCarla P. GomesKilian Q. WeinbergerPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- control problems
- wide range
- low variance
- control strategy
- control policy
- optimal control
- control system
- high precision
- adaptive control
- data acquisition
- markov decision processes
- real time
- state space
- learning process
- multi agent
- case study
- information systems
- data sets
- action space
- control policies