Settling the Reward Hypothesis.

Michael Bowling John D. Martin David Abel Will Dabney

Published in: CoRR (2022)

Keyphrases

reinforcement learning
real time
neural network
information technology
data sets
genetic algorithm
search algorithm
relational databases
wireless sensor networks
long run
reward function
hypothesis space
average reward