Login / Signup
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation.
Zhihan Liu
Yufeng Zhang
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
Published in:
ICML (2022)
Keyphrases
</>
function approximation
reinforcement learning
function approximators
temporal difference learning algorithms
temporal difference
radial basis function
optimal policy
learning tasks
policy gradient
reinforcement learning problems
policy evaluation
pairwise
policy search