A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning.

Published in: CoRR (2021)

Keyphrases