Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle.

Published in: CoRR (2019)

Keyphrases