Publication: Error bounds and dynamics of bootstrapping in actor-critic reinforcement learning.