Reinforcement Learning with a Terminator.
Guy TennenholtzNadav MerlisLior ShaniShie MannorUri ShalitGal ChechikAssaf HallakGal DalalPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- function approximation
- machine learning
- robotic control
- temporal difference learning
- state space
- multi agent
- supervised learning
- markov decision processes
- optimal control
- reinforcement learning algorithms
- function approximators
- continuous state
- optimal policy
- learning problems
- model free
- reward function
- control problems
- learning agents
- real time