Reinforcement Learning with a Terminator.
Guy TennenholtzNadav MerlisLior ShaniShie MannorUri ShalitGal ChechikAssaf HallakGal DalalPublished in: NeurIPS (2022)
Keyphrases
- reinforcement learning
- function approximation
- state space
- learning algorithm
- model free
- reinforcement learning algorithms
- machine learning
- dynamic programming
- information retrieval
- multi agent
- learning process
- transition model
- optimal policy
- stochastic approximation
- partially observable domains
- evolutionary learning
- policy search
- perceptual aliasing
- robotic control
- real time
- temporal difference learning
- learning agent
- learning problems
- markov chain
- supervised learning
- case study
- databases
- data sets