Reinforcement Learning with a Terminator.

Guy Tennenholtz Nadav Merlis Lior Shani Shie Mannor Uri Shalit Gal Chechik Assaf Hallak Gal Dalal

Published in: CoRR (2022)

Keyphrases

reinforcement learning
function approximation
machine learning
robotic control
temporal difference learning
state space
multi agent
supervised learning
markov decision processes
optimal control
reinforcement learning algorithms
function approximators
continuous state
optimal policy
learning problems
model free
reward function
control problems
learning agents
real time