Login / Signup
The Phenomenon of Policy Churn.
Tom Schaul
André Barreto
John Quan
Georg Ostrovski
Published in:
NeurIPS (2022)
Keyphrases
</>
optimal policy
asymptotically optimal
artificial intelligence
policy making
neural network
expert systems
state space
decision process
reward function