The Phenomenon of Policy Churn.

Tom Schaul André Barreto John Quan Georg Ostrovski

Published in: NeurIPS (2022)

Keyphrases

optimal policy
asymptotically optimal
artificial intelligence
policy making
neural network
expert systems
state space
decision process
reward function